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Group Art Unit : 1637 



ON APPEAL TO THE BOARD OF PATENT APPEALS AND INTERFERENCES 

APPELLANT'S BRIEF 



Mail Stop Appeal Brief - Patents 

COMMISSIONER FOR PATENTS 
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Alexandria, VA 22313-1450 



The Applicants appeal the rejection of Claims 27-28, 32-34, and 48-54 in the above- 
captioned patent application. These claims were rejected in a Final Office Action mailed April 
25, 2005. Applicants filed a Notice of Appeal July 22, 2005. Applicants file concurrently 
herewith an Amendment After Final Office Action. 

I. REAL PARTY IN INTEREST 

Pursuant to 37 C.F.R. 41.37(c)(1), Appellants hereby notify the Board of Patent Appeals 
and Interferences that the real party in interest is the assignee of record for this application, 
Genentech, Inc., 1 DNA Way, South San Francisco, CA 94080. 



II. RELATED APPEALS AND INTERFERENCES 

A Notice of Appeal has been filed in the related Application No. 10/033,396. Appellants 
are unaware of any other related appeals or interferences. 
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Dear Sir: 



01 FC-.1402 



500.00 OP 



- 1 - 



AppL No. : 10/032,996 

Filed : December 27, 2001 



III. STATUS OF THE CLAIMS 

The above-captioned application was filed with Claims 1-21. Claims 1-21 were canceled 
and new Claims 22-34 were added in a Preliminary Amendment mailed December 27, 2001. 
Claims 22-26 were canceled and new Claims 35-45 were added in an Amendment and Response 
to Office Action dated June 16, 2004. Claims 36 and 41-45 were canceled and new Claims 46- 
47 were added in an Amendment and Response to Final Office Action dated December 6, 2004, 
however an Advisory Action mailed December 20, 2004 stated that the amendments were not 
entered by the Examiner. Claims 36 and 41-45 were again canceled and new Claims 46-54 were 
added in a Submission Filed with Request for Continued Examination mailed March 4, 2005. 
Claims 27-35, 37-40 and 46-54 were finally rejected by the Examiner in a final Office Action 
mailed April 25, 2005. Appellants file concurrently herewith an Amendment After Final Office 
Action amending Claims 27 and 48 to delete elements (b)-(d) and canceling Claims 29-31, 35 
and 37-47 without prejudice to, or disclaimer of, the subject matter contained therein. 
Accordingly, Claims 27-28, 32-34 and 48-54 are the subject of this appeal. The claims attached 
hereto as Appendix A reflect the claims as amended by the Amendment After Final Office 
Action filed concurrently herewith. 

IV. STATUS OF AMENDMENTS 

Appellants mailed a "Request for Continued Examination" on March 4, 2005, canceling 
Claims 36 and 41-45 and adding new Claims 46-54, and offering additional evidence in support 
of Appellants' arguments. In a Final Office Action mailed April 25, 2005, the Examiner 
indicated that the submission filed on March 8, 2005 had been entered. Appellants file 
concurrently herewith an Amendment After Final Office Action amending claims 27 and 48 to 
delete elements (b)-(d) and canceling Claims 29-31, 35 and 37-47. 

V. SUMMARY OF THE CLAIMED SUBJECT MATTER 

The claimed subject matter relates to isolated polypeptides related to the polypeptide 

having SEQ ID NO:7. Independent Claims 27 and 48 read: 

27. An isolated polypeptide comprising: 

(a) the amino acid sequence of the polypeptide of SEQ ID 
NO:7;or 
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(b) the amino acid sequence of the polypeptide encoded by the 
full-length coding sequence of the cDNA deposited under ATCC 
accession number 203661. 

48. An isolated polypeptide having at least 95% amino acid sequence 
identity to: 

(a) the amino acid sequence of the polypeptide of SEQ ID 
NO:7; or 

(b) the amino acid sequence of the polypeptide encoded by the 
full-length coding sequence of the cDNA deposited under ATCC 
accession number 203661 ; 

wherein said isolated polypeptide or a fragment thereof can be 
used to generate an antibody which can be used to specifically detect the 
polypeptide of SEQ ID NO: 7 in lung or colon tissue samples. 

Various aspects of the claimed polypeptides are described in the substitute specification 
at, for example, page 2, line 20 through page 5, line 4; page 10, line 4 through page 12, line 24; 
page 30, lines 16-23; page 32, line 29 through page 34, line 19; page 34, lines 26-29; page 35 
line 20 through page 36, line 25; page 37, line 16 through page 39, line 4; page 42, lines 13-22; 
page 43, lines 6-11; page 44, lines 0-6; page 45, line 14 through page 46, line 36; page 68, lines 
1-15 and 24-29; page 69, line 36 through page 79, line 20; page 83, lines 17-20; page 87, line 35 
through page 95 line 32; page 100, line 29 through page 101, line 13; page 108, line 20 through 
page 114, line 54; page 118, line 23 through page 125, line 10; page 126, line 32 through page 
127, line 29; and Figures 3 and 4. SEQ ID NO: 7 is disclosed in the Sequence Listing appended 
to the application. 

VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL AND GROUPING 
OF CLAIMS 

A. Grounds of Rejection on Appeal 

The Examiner has rejected Claims 27-28, 32-34 and 48-54 under 35 U.S.C. §101, stating 
that the claimed invention is not supported by either a specific and substantial asserted utility or a 
well-established utility. 

The Examiner has rejected Claims 48-54 under 35 U.S.C. § 112, first paragraph, as 
lacking an adequate written description, stating that "there is no record of description which 
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would demonstrate conception of any proteins other than those expressly disclosed which 
comprise SEQ ID NO:7." 

The Examiner has also rejected Claims 27-28, 32-34 and 48-54 under 35 U.S.C. §112, 
first paragraph as lacking an enabling disclosure, asserting that "it would require undue 
experimentation for one of skill in the art to perform the method of the claim as broadly written." 

B. Grouping of Claims 

1. Utility Rejection - Claims 27-28. 32-34 and 48-54 

For purposes of the utility rejection under 35 U.S.C. § 101, Claims 27-28, 32-34 and 48- 
54 can be considered as a group. 

2. Written Description Rejection - Claims 48-54 

a. Group 1 - Claim 48 

For purposes of the written description rejection under 35 U.S.C. § 112, first paragraph, 
Claim 48 should be considered individually. 

b. Group 2 - Claim 49 

For purposes of the written description rejection under 35 U.S.C. § 112, first paragraph, 
Claim 49 should be considered individually. 

c. Group 3 - Claim 50 

For purposes of the written description rejection under 35 U.S.C. § 112, first paragraph, 
Claim 50 should be considered individually. 

d. Group 4 - Claim 51 

For purposes of the written description rejection under 35 U.S.C. § 112, first paragraph, 
Claim 5 1 should be considered individually. 

e. Group 5 - Claim 52 

For purposes of the written description rejection under 35 U.S.C. § 112, first paragraph, 
Claim 52 should be considered individually. 

/ Group 6 - Claim 53-54 

For purposes of the written description rejection under 35 U.S.C. § 112, first paragraph, 
Claims 53-54 should be considered as a group. 
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3. Enablement Rejection - Claims 27-28. 32-34 and 48-54 

a. Group 1 - Claims 27-28. and 32-34 

For purposes of the enablement rejection under 35 U.S.C. § 1 12, first paragraph, Claims 
27-28, and 32-34 can be considered as a group. 

b. Group 2 - Claims 48 and 53-54 

For purposes of the enablement rejection under 35 U.S.C. § 1 12, first paragraph, Claims 

48 and 53-54 can be considered as a group. 

c. Group 3 - Claim 49 

For purposes of the enablement rejection under 35 U.S.C. § 112, first paragraph, Claim 

49 should be considered individually. 

d. Group 4 - Claim 50 

For purposes of the enablement rejection under 35 U.S.C. § 112, first paragraph, Claim 

50 should be considered individually. 

e. Group 5 - Claim 51 

For purposes of the enablement rejection under 35 U.S.C. § 112, first paragraph, Claim 

51 should be considered individually. 

/ Group 6 - Claim 52 
For purposes of the enablement rejection under 35 U.S.C. § 112, first paragraph, Claim 

52 should be considered individually. 

VII. APPELLANTS' ARGUMENT 

A. Summary of the Arguments 

1. Utility Rejection 

The first issue before the Board is whether Appellants have asserted at least one "specific, 
substantial, and credible utility." See Examination Guidelines, 66 Fed. Reg. 1092 (2001). 
Appellants have asserted that the claimed polypeptides related to the polypeptide of SEQ ID NO: 
7 (the PR0539 polypeptide) are useful as diagnostic tools for cancer, particularly for lung and 
colon cancer. This asserted utility is specific, substantial, and credible. 

Briefly stated, Appellants' asserted utility is based on the disclosure in Example 16 of the 

instant application that the gene encoding the PR0539 polypeptide is amplified at least two-fold 
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in the majority of the lung and colon tumors tested compared to normal tissue. It is well- 
established that gene amplification is correlated with an overexpression of the corresponding 
mRNA and the encoded polypeptide. Thus, one of skill in the art would be more likely than not 
to believe that the amplification of the PR0539 gene in a majority of lung and colon tumors 
leads to an increase in PR0539 mRNA and protein in these lung and colon tumors compared to 
their normal tissue counterpart. This differential expression of PR0539 mRNA and polypeptide 
is useful for distinguishing lung or colon tumor tissue from its normal tissue counterpart. 
Therefore, the claimed polypeptides related to the PR0539 polypeptide have a specific, 
substantial and credible utility as diagnostic tools for cancer, particularly lung and colon cancer, 
as is explained in more detail below. 

2. Written Description Rejection 

The second issue before the Board is whether the claimed subject matter is described in 
the specification in such a way as to reasonably convey to one skilled in the art that the inventors 
had possession of the claimed invention at the time the application was filed. The Examiner has 
rejected Claims 48-54 under 35 U.S.C. § i 12, first paragraph, as lacking an adequate written 
description, stating that "there is no record of description which would demonstrate conception 
of any proteins other than those expressly disclosed which comprise SEQ ID NO:7." Office 
Action at 7. 

The Examiner has the initial burden of rebutting the presumption that the written 
description is adequate. He has failed to meet this initial burden because nowhere in the Final 
Office Action does the Examiner address his arguments to Claims 48-54 which were added in 
Appellants' previous amendment, and the arguments he has made are either flawed, or do not 
apply to Claims 48-54. For the reasons detailed below, Appellants submit that Claims 48-54 are 
adequately described such that one of skill in the art would recognize that the inventors had 
possession of the claimed invention at the time the application was filed. 

3. Enablement Rejection 

The third issue before the Board is whether Appellants have enabled the pending claims 
such that one of skill in the art would be able to make and use the claimed invention. The 
Examiner has rejected Claims 27-28, 32-34 and 48-54 under 35 U.S.C. §112, first paragraph, 
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arguing that the claimed subject matter was not described in the specification in such a way as to 
enable one skilled in the art to make and/or use the invention. Office Action at 10. The 
Examiner recites the factors for determining enablement from In re Wands, 858 F.2d 731, 8 
U.S.P.Q. 2d 1400 (Fed. Cir. 1988), relying to a large extent on the arguments made above in 
support of the rejection for lack of utility. 

Appellants submit that Claims 27-28, 32-34 and 48-54 are enabled such that one of skill 
in the art could make and use the claimed polypeptides without undue experimentation. With 
respect to Claims 27-28 and 32-34, how to make the polypeptide of SEQ ID NO: 7 and the 
polypeptide encoded by the cDNA deposited under ATCC accession number 203661 is within 
the skill in the art. Similarly, with respect to Claims 48-54, it is well within the skill of those in 
the art to make polypeptides that are at least 95% identical to SEQ ID NO: 7 and the polypeptide 
encoded by ATCC 203661, and it is well within the knowledge of those skilled in the art how to 
make antibodies which are specific to a disclosed sequence. See also In re Wands, 858 F.2d 731 
(reversing the Board's decision of non-enablement and holding that as of 1980, undue 
experimentation was not required to make high-affinity monoclonal antibodies to a target 
peptide). Thus, one of skill in the art would be able to make the claimed polypeptides without 
undue experimentation, and the Examiner has not made any arguments to the contrary. 

Appellants assert that the claimed polypeptides are useful as diagnostic tools for cancer, 
particularly lung and colon cancer based on the amplification of the PR0539 gene in lung and 
colon tumors. As detailed below, it is well-established that gene amplification is correlated with 
an overexpression of the encoded polypeptide, and thus it is likely that the PR0539 polypeptide 
is overexpressed in lung and colon cancer. Thus, based on the disclosure in the application, one 
of skill in the art would be able to use the claimed polypeptides as diagnostic tools to distinguish 
suspected lung or colon tumors from normal tissue without undue experimentation. 

B. Utility Rejection - Detailed Arguments 

The first issue before the Board is whether Appellants have asserted at least one "specific, 
substantial, and credible utility." See Examination Guidelines, 66 Fed. Reg. 1092 (2001). 
Appellants have asserted that the claimed polypeptides related to the polypeptide of SEQ ID NO: 
7 (the PR0539 polypeptide) are useful as diagnostic tools for cancer, particularly for lung and 
colon cancer. This asserted utility is specific, substantial, and credible. 
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L Utility - Legal Standard 

A "specific utility" is defined as utility which is "specific to the subject matter claimed/' 
in contrast to "a general utility that would be applicable to the broad class of the invention." See 
M.P.E.P. § 2107.01 I. For example, it is generally not enough to state that a nucleic acid is 
useful as a diagnostic tool without also identifying the condition that is to be diagnosed. 

The requirement of "substantial utility" defines a "real world" use, and derives from the 
Supreme Court's holding in Brenner v. Manson, 383 U.S. 519, 534 (1966) stating that "[t]he 
basic quid pro quo contemplated by the Constitution and the Congress for granting a patent 
monopoly is the benefit derived by the public from an invention with substantial utility." In 
explaining the "substantial utility" standard, M.P.E.P. § 2107.01 cautions, however, that Office 
personnel must be careful not to interpret the phrase "immediate benefit to the public" or similar 
formulations used in certain court decisions to mean that products or services based on the 
claimed invention must be "currently available" to the public in order to satisfy the utility 
requirement. "Rather, any reasonable use that an applicant has identified for the invention that 
can be viewed as providing a public benefit should be accepted as sufficient, at least with regard 
to defining a 'substantial' utility." M.P.E.P. § 2107.01 (emphasis added). 

Indeed, the Guidelines for Examination of Applications for Compliance With the Utility 
Requirement, set forth in M.P.E.P. § 2107 11(B)(1) gives the following instruction to patent 
examiners: "If the applicant has asserted that the claimed invention is useful for any particular 
practical purpose . . . and the assertion would be considered credible by a person of ordinary skill 
in the art, do not impose a rejection based on lack of utility." 

Finally, in assessing the credibility of the asserted utility, the M.P.E.P. states that "to 
overcome the presumption of truth that an assertion of utility by the applicant enjoys" the PTO 
must establish that it is "more likely than not that one of ordinary skill in the art would doubt (i.e., 
'question') the truth of the statement of utility." M.P.E.P. § 2107.02 III A. 

2. Utility - Burden of Proof 

It is well established that a specification which contains a disclosure of utility which 
corresponds in scope to the subject matter sought to be patented "must be taken as sufficient to 
satisfy the utility requirement of § 101 for the entire claimed subject matter unless there is reason 
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for one skilled in the art to question the objective truth of the statement of utility or its scope." In 
re Lunger, 503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). Thus "the PTO has the 
initial burden of challenging a presumptively correct assertion of utility in the disclosure." In re 
Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). Only after the PTO provides 
evidence showing that one of ordinary skill in the art would reasonably doubt the asserted utility 
does the burden shift to the applicant to provide rebuttal evidence sufficient to convince such a 
person of the invention's asserted utility. Id. 

3. Utility — Standard of Proof 

Compliance with 35 U.S.C. § 101 is a question of fact. Raytheon v. Roper, 724 F.2d 951, 

956, 220 U.S.P.Q. 592, 596 (Fed. Cir. 1983). The evidentiary standard to be used throughout ex 

parte examination in setting forth a rejection is a preponderance of the evidence, or "more likely 

than not" standard. In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444 (Fed. Cir. 

1992). This is stated explicitly in the M.P.E.P.: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M.P.E.P. § 2107.02, part VII (emphasis in original, 
citations omitted). 

The Court of Appeals for the Federal Circuit has stated that the standard for satisfying the 

utility requirement is a low one: 

The threshold of utility is not high : An invention is '^useful" under section 101 if 
it is capable of providing some identifiable benefit. See Brenner v. Manson, 383 
U.S. 519, 534, 86 S.Ct. 1033, 16 L.Ed.2d 69 (1966); Brooktree Corp. v. Advanced 
Micro Devices, Inc., 977 F.2d 1555, 1571 (Fed. Cir. 1992) ("To violate § 101 the 
claimed device must be totally incapable of achieving a useful result"); Fuller v. 
Berger, 120 F. 274, 275 (7th Cir. 1903) (test for utility is whether invention "is 
incapable of serving any beneficial end"). Juicy Whip, Inc. v. Orange Bang, Inc., 
185 F.3d 1364, 1366, 51 U.S.P.Q. 2d 1700 (Fed. Cir. 1999) (emphasis added). 

The low threshold for satisfying the utility requirement is reflected in the standard set by the 
Federal Circuit for invalidating a patent based on a lack of utility: "[T]he fact that an invention 
has only limited utility and is only operable in certain applications is not grounds for finding lack 
of utility. Some degree of utility is sufficient for patentability. Further, the defense of non- 
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utility cannot be sustained without proof of total incapacity ." Envirotech Corp. v. Al George, 

Inc., 730 F.2d 753, 762, 221 U.S.P.Q. 473 (Fed. Cir. 1984) (emphasis added, citations omitted). 

Because the standard for satisfying the utility requirement is so low, requiring total 

incapacity for a finding of no utility, the M.P.E.P. cautions that: 

Rejections under 35 U.S.C. 101 have been rarely sustained by federal courts. 
Generally speaking, in these rare cases, the 35 U.S.C. 101 rejection was sustained 
[] because the applicant . . . asserted a utility that could only be true if it violated a 
scientific principle, such as the second law of thermodynamics, or a law of nature, 
or was wholly inconsistent with contemporary knowledge in the art. M.P.E.P. § 
2107.02 III B., citing In re Gazave, 379 F.2d 973, 978, 154 U.S.P.Q. 92, 96 
(C.C.P.A. 1967) (underline emphasis in original, italic emphasis added). 

4. Appellants Asserted a Specific. Substantial and Credible Utility that is 
Sufficient to Satisfy the Utility Requirement ofS 101 

The claimed subject matter relates to polypeptides related to the polypeptide having SEQ 
ID NO: 7. The polypeptide of SEQ ID NO: 7 (referred to as "PR0539 polypeptide") is encoded 
by the polynucleotide of SEQ ID NO: 6 (also referred to as DNA47465-1561). Specification at 
34, lines 26-29. 1 Appellants have asserted that the claimed polypeptides are useful as diagnostic 
tools for cancer, particularly lung and colon cancer. 

In "Example 16: Gene Amplification" Appellants disclose that the gene encoding 
PR0539 polypeptide is amplified at least 2-fold in a majority of lung and colon tumors tested 
compared to normal tissue, respectively (a ? Ct value of 1 corresponds to a 2-fold increase, 2 ? Ct 
corresponds to 4-fold, 3 ? Ct corresponds to 8-fold, etc.). Specification at 108, line 20 through 
page 114, line 54. As explained on page 108, line 28 through page 110, line 13 of the 
specification, the amplification of the PR0539 gene was detected using the well-established 
technique of quantitative TaqMan PCR amplification of genomic DNA isolated from lung and 
colon tumors. See also, Declaration of Audrey Goddard t Ph.D. 

The specification teaches that amplification of a PRO polypeptide-encoding gene in one 
or more tumor tissues as compared normal tissue "is associated with overexpression of the gene 
product, indicating that the polypeptides are useful targets for therapeutic intervention... and 



1 All citations to the Specification are to the substitute specification mailed on March 27, 
2002 as subsequently amended. 
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diagnostic determination of the presence of those cancers." Specification at 108, lines 22-25. 

Likewise, Appellants disclose the use of antibodies to PRO polypeptides as diagnostic tools: 

[A]nti-PRO antibodies may be used in diagnostic assays for PRO, e.g., detecting 
its expression in specific cells, tissues, or serum. Various diagnostic assay 
techniques known in the art may be used, such as competitive binding assays, 
direct or indirect sandwich assays and immunoprecipitation assays conducted in 
either heterogeneous or homogeneous phases. Specification at 95, lines 9-12. 

Taken together, the specification clearly discloses the use of the claimed polypeptides as 
diagnostic tools for cancer, for example by detection with specific antibodies, particularly lung 
and colon cancer. This utility is substantial, as one of skill in the art will recognize that the 
diagnosis of cancer is a "real world" use; it is specific, as the diagnosis of lung and colon cancer 
is not a utility that applies to the broad class of polypeptides; and it is credible, as it is not a 
utility "that could only be true if it violated a scientific principle, ...or a law of nature, or [is] 
wholly inconsistent with contemporary knowledge in the art." M.P.E.P. § 2107.02 III B., citing 
In re Gazave, 379 F.2d 973, 978, 154 U.S.P.Q. 92, 96 (C.C.P.A. 1967). 

Because Appellants' specification contains a disclosure of utility which corresponds in 
scope to the claimed subject matter, polypeptides related to SEQ ID NO:7, and those which can 
be used to make antibodies which specifically bind to the polypeptide of SEQ ID NO: 7, the 
asserted utility "must be taken as sufficient to satisfy the utility requirement of § 101 for the 
entire claimed subject matter unless there is reason for one skilled in the art to question the 
objective truth of the statement of utility or its scope." In re Langer, 503 F.2d 1380, 1391, 183 
U.S.P.Q. 288, 297 (C.C.P.A. 1974). Therefore, the burden of establishing a prima facie case of 
lack of utility rests with the PTO. See, In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 
(Fed. Cir. 1995) ("the PTO has the initial burden of challenging a presumptively correct assertion 
of utility in the disclosure"). 

5. The Examiner's Arguments 

To establish a prima facie showing that the claimed subject matter lacks utility, the 
Examiner must "provide[] evidence showing that one of ordinary skill in the art would 
reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 
(Fed. Cir. 1995). The Examiner has filed a first Office Action on March 19, 2004, a final Office 
Action on September 7, 2004, an Advisory Action on December 20, 2004, and a second final 
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Office Action on April 25, 2005 after Appellants' Request for Continued Examination. None of 
these papers provide sufficient evidence that one of ordinary skill in the art would reasonably 
doubt the asserted utility. 

The final Office Action on April 25, 2005, contains all of the Examiner's arguments in 
support of the utility rejection made in the previous office actions. Appellants summarize these 
arguments below. 

a. Appellants' evidence is allegedly deficient 

The Examiner has asserted that that the evidence of PR0539 gene amplification is 
deficient, stating: 

The level of overexpression of PRO 539 was minimal, not even a two fold 
overexpression in any cell type. Further, no statistical data was presented to show 
that the overexpression was significant in any way with no P-value or other 
statistical measure to demonstrate that, the overexpression was a real effect and 
not simply produced by chance. Office Action at 3 2 . 

b. The Examiner asserts that there is no "necessary" connection between 
DNA amplification. mRNA level, and protein level 

The critical argument for the Examiner's rejection is that "the overexpression of the 
nucleic acid is not relevant to the utility of the protein." Office Action at 3. The Examiner cites 
Meric et al (Mol. Cancer Ther. (2002) 1:971-979) Gokman -Polar et al (Cancer Res. (2001) 
61:1375-1381) and Pennica et al (Proc. Natl. Acad. Sci. USA (1998) 95:14717-14722) to 
support his assertion that "not only is there no necessary connection between the level of protein 
in a cell and the amount of mRNA, but there is also no necessary correlation between the amount 
of DNA in a cell and the amount of mRNA." Office Action at 4 (emphasis added). Therefore, 
the Examiner concludes that "any evidence by Applicant showing overexpression of one 
component does not provide utility for the protein itself." Id, at 4. 



2 Citation to "Office Action" refers to the final Office Action mailed April 25, 2005 
unless otherwise indicated. 
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c. The Examiner asserts that a protein's function cannot be predicted from 
its sequence 

The Examiner also argues that "given the breadth of these claims which encompass 95% 
identical molecules, there is an abundance of evidence that very similar proteins can perform 
very different functions." Office Action at 5. The Examiner cites Rost et al (J. Mol. Biol. 
(2002) 318(2):595-608) to support the assertion that "even high levels of homology do not 
necessarily correlate with actual protein function. In the current case, where the function of 
PRO-539 (SEQ ID NO: 7) is not known, the expectation is even lower that there is any utility 
that can be derived based upon the sequence." Id. 

d. The Examiner asserts that the instant case is similar to Examples 4 and 
12 of the Utility Guidelines 

The Examiner argues that the instant case is extremely similar to Example 12 of the 
Utility Guidelines where a protein which was known to be a receptor, but where the ligand was 
unknown, was found to lack utility. The Examiner argues that "[i]n the current case, the putative 
PRO-539 protein, lacks any substantial utility whatsoever, and solely relies upon an small level 
of mRNA overexpression in cancer cells. However, there is no necessary relationship between 
the protein levels or utilities and such an overexpression of the nucleic acid." Office Action at 5 
(emphasis added). The Examiner concludes that further research would be required to identify 
and reasonably confirm a "real world" context of use for PRO-539. Without explanation, the 
Examiner also asserts that the instant case "directly tracks" Example 4 of the Utility Guidelines 
as well. Office Action at 6. 

e. The Examiner asserts that the claimed invention lacks a specific utility 
In addition to the arguments above that the claimed invention lacks a substantial utility, 

the Examiner also asserts that the claimed invention lacks a specific utility. The Examiner 
argues that "the protein, as distinguished from the nucleic acid, has not been associated with any 
disease, any condition, or any other specific feature. There is no association of the antibody or 
protein with cancer or with any other disease." Office Action at 6. The Examiner states that "a 
general statement of diagnostic utility, such as diagnosing an unspecified disease, would 
ordinarily be insufficient absent a disclosure of what condition can be diagnosed." Id. 
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6. The Examiner has not established a prima facie case that Claims 27-28. 32-34 
and 48-54 lack Utility 

The above arguments do not satisfy the Examiner's burden to "provide[] evidence 
showing that one of ordinary skill in the art would reasonably doubt the asserted utility." In re 
Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). The Examiner has the burden 
of presenting "countervailing facts and reasoning sufficient to establish that a person of ordinary 
skill would not believe the applicant 's assertion of utility." M.P.E.P. §2107.02 III.A., citing in 
re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995) ("Only after the PTO 
provides evidence showing that one of ordinary skill in the art would reasonably doubt the 
asserted utility does the burden shift to the applicant to provide rebuttal evidence"). The 
Examiner's argument hinges on his assertion that there is no "necessary connection" or 
"necessary correlation" between gene amplification and mRNA or protein overexpression. The 
Examiner cites three references to support this assertion. 

The Examiner's arguments fail for two reasons. First, the lack of a "necessary," z.e,, 
precise or perfect, connection is not sufficient to establish that one of skill in the art would 
reasonably doubt Appellants' asserted utility. Appellants do not need to establish the utility 
"beyond a reasonable doubt" or to a "statistical certainty." Instead, Appellants need show that 
one of ordinary skill in the art would conclude that the asserted utility is more likely than not true . 
See M.P.E.P. § 2107.02, part VII. Thus, the lack of a "necessary" connection does not mean that 
Appellants' asserted utility is not true under a more likely than not standard. For example, if the 
presence of A results in the presence of B in 9 out of 10 cases there is no "necessary" connection 
between A and B, yet one would believe that it is more likely than not true that given A, B will 
be present. 

The second reason the Examiner's arguments fail is that the references relied on by the 
Examiner are not contrary to Appellants' asserted utility, and in fact actually support Appellants' 
position. Because the references relied on by the Examiner offer little or no support for his 
assertions, the Examiner has failed to establish a prima facie showing that one of skill in the art 
would reasonably doubt the asserted utility, and the Board should accept Appellants' disclosed 
utility as sufficient. Appellants address each of the Examiner's arguments in turn. 
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a. The data in Example 16 are sufficient to establish the asserted utility 
Appellants turn first to the Examiner's arguments challenging the reliability of the data 
reported in Example 16. The Examiner makes two arguments regarding the sufficiency of the 
data in Example 16: (1) he argues that the level of overexpression of PROS 3 9 was minimal, "not 
even a two fold overexpression in any cell type;" and (2) that there is no statistical analysis to 
show that the data were significant. Office Action at 3. Appellants submit that the gene 
amplification data provided in Example 16 are sufficient to establish a specific and substantial 
utility for PR0539 and related polypeptides. 

As described in Example 16 of the present application, the inventors isolated genomic 
DNA from a variety of primary cancers and cancer cell lines. As a negative control, DNA was 
isolated from the blood of normal healthy individuals. Specification at 108, lines 29-31. Gene 
amplification was monitored using real-time quantitative TaqMan™ PCR. The gene 
amplification results are set forth in Table 7 (Table 8 as amended) on page 114. As explained in 
the specification on page 109, lines 0-2, the results of TaqMan™ PCR are reported in ACt units. 
It is well-known in the art that "Ct" stands for "threshold cycle." One Ct unit corresponds to one 
PCR cycle or approximately a 2-fold amplification, relative to control, 2 units correspond to 4- 
fold amplification, 3 units to 8-fold amplification, etc. Id. 

Looking at the results reported in on page 114, nine primary lung tumors and eight 
primary colon tumors were tested, as well as a number of tumor cell lines. PR0539 had a ACt 
value of greater than 1 , i.e. t more than two-fold amplification , in six of nine lung tumors and five 
of eight colon tumors. These data show that in more than half of the lung and colon tumors 
tested, the gene for PR0539 was amplified at least two-fold. Thus, the Examiner's statement 
that the overexpression of PR0539 was minimal, "not even a two fold overexpression in any cell 
type" is simply wrong. 

The Examiner has also rejected the data because he questions the statistical significance 

of the data. However, Appellants are not required to prove utility to a statistical certainty, only 

that it is more likely than not true. See Nelson v. Bowler, 626 F.2d 853, 856-57, 206 U.S.P.Q. 

881, 883-84 (C.C.P.A. 1980) (reversing the Board and rejecting an argument that evidence of 

utility was insufficient because it was not statistically significant). As the M.P.E.P. states: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
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statistical certainty. Instead, evidence will be sufficient if, considered as a 
whole, it leads a person of ordinary skill in the art to conclude that the asserted 
utility is more likely than not true . M.P.E.P. § 2107.02, part VII (bold emphasis 
added, underline in original, citations omitted). 

Therefore, whether the results are statistically significant or not is irrelevant to 
establishing the asserted utility. The results must simply be reliable enough that one of skill in 
the art would believe that the utility is more likely than not true. 

In support of Appellants' assertion of utility, Appellants submitted a copy of the 
declaration of Dr. Audrey Goddard with exhibits A-G (the Goddard Declaration). As Dr. 
Goddard's curriculum vitae, Exhibit A of the Goddard Declaration, shows, she is an expert in the 
art of identifying and quantifying the amplification of oncogenes in cancers. 

In her declaration, Dr. Goddard states that 

[T]he quantitative TaqMan PCR technique is technically sensitive enough to 
detect at least a 2-fold increase in gene copy number relative to control. It is 
further my considered scientific opinion that ... a gene identified as being 
amplified at least 2-fold by the quantitative TaqMan PCR assay in a tumor sample 
relative to a normal sample is useful as a marker for the diagnosis of cancer , for 
monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. Goddard Declaration at \1 (emphasis added). 

Therefore, according to Dr. Goddard, an expert in the field, a 2-fold or more increase, i.e., 
a ACt value of 1 or greater, is significant and reliable enough to be useful for the diagnosis of 
cancer. The ACt value for PR0539 is greater than 1 for the majority of primary lung and colon 
tumors tested, indicating a more than two- fold amplification. Specification at page 1 14. 

The Examiner has not offered any substantial reason or evidence to reject either the 
underlying data or Dr. Goddard's conclusions. Instead, the Examiner has dismissed the Goddard 
declaration as "fundamentally flawed," arguing the it fails to provide specific evidence regarding 
PR0539. Final Office Action mailed 9/7/04 at 14. While it is true that Dr. Goddard's 
Declaration discusses the significance of the data generically without specific reference to 
PR0539, her declaration read in light of the data reported in Example 16 clearly make her 
declaration applicable to PR0539. 

Dr. Goddard is an expert in the field whose declaration is based on personal knowledge 
of the relevant facts at issue. Appellants' have reminded the Examiner that "Office personnel 
must accept an opinion from a qualified expert that is based upon relevant facts whose accuracy 
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is not being questioned." M.P.E.P. § 2107 (emphasis added). In addition, declarations relating 
to issues of fact should not be summarily dismissed as "opinions" without an adequate 
explanation of how the declaration fails to rebut the Examiner's position. See in re Alton 76 F.3d 
1168 (Fed. Cir. 1996). Therefore, the Examiner should accept Dr. Goddard's statement that "a 
gene identified as being amplified at least 2-fold by the quantitative TaqMan PCR assay in a 
tumor sample relative to a normal sample is useful as a marker for the diagnosis of cancer." 

Interestingly, the above argument along with the Goddard Declaration were presented to 
Examiner Fredman in the closely related and co-owned patent application Serial No. 10/033,167 
to overcome a nearly identical 35 U.S.C. § 101 utility rejection of claims directed in part to the 
nucleic acid sequence encoding the PR0539 protein. That application contains the same data 
presented and relied on in the instant application. In that case, the Examiner concluded that the 
Goddard declaration was "sufficient to overcome the rejection of the claims based upon utility 
and enablement as discussed above." Office Action mailed 9/9/2003, page 5. 

Thus, it is the uncontested opinion of an expert in the field that the results in Example 16 
are significant and reliable enough to indicate that the PR0539 gene is useful as a diagnostic tool 
for cancer, and the PTO has accepted this utility in a related application. 

In conclusion, Appellants submit that the evidence reported in Example 1 6, supported by 
the Goddard Declaration, establish that the PR0539 gene is amplified at least two-fold in the 
majority of lung and colon tumors tested and can be used as a marker for the diagnosis of cancer. 
The Examiner has not offered any significant arguments or evidence to the contrary, and 
therefore has not established a prima facie case that one of skill in the art would reasonably 
doubt the significance of the data in Example 16, or the assertion that the PR0539 gene can be 
used as a diagnostic tool. For the reasons discussed below, based on these data Appellants' have 
established a specific and substantial utility for the PR0539 polypeptide as well. 

b. The Examiner's cited references are not contrary to Appellants' 
assertion that uene amplification generally leads to increased mRNA 
and protein leyels 

Appellants next address the heart of the Examiner's rejection - his assertion that "not 
only is there no necessary connection between the level of protein in a cell and the amount of 
mRNA, but there is also no necessary correlation between the amount of DNA in a cell and the 
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amount of mRNA. Therefore, any evidence by Applicant showing overexpression of one 
component does not provide utility for the protein itself Office Action at 4 (emphasis added). 

As an initial matter, Appellants emphasize that the level of proof required to establish 
utility is relatively low. "[T]he applicant does not have to provide evidence sufficient to 
establish that an asserted utility is true 'beyond a reasonable doubt.' Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of statistical certainty." 
M.P.E.P. § 2107.02, part VII (citations omitted). Therefore, there does not have to be a 
"necessary," precise or perfect correlation between gene amplification and increased mRNA and 
protein levels. Instead, the evidence must be sufficient to convince a skilled artisan that the 
asserted utility is "more likely than not true." Id, 

Thus, even if the references cited by the Examiner supported the conclusion that there is 
no "necessary" connection between gene amplification, mRNA, and protein levels, this is not the 
requisite standard which Appellants must meet to satisfy their burden. Instead, there need only 
be evidence that the asserted utility is "more likely than not true." As discussed above, it does 
not follow that one of skill in the art would not believe that the asserted utility is more likely than 
not true even //"the connection between gene amplification and the level of mRNA and protein is 
not "necessary." Stated another way, the correlation between gene amplification and the level of 
mRNA and protein does not have to be exact for the asserted utility to be more likely than not 
true. 

Keeping the standard of proof in mind, we turn to the three references the Examiner cites 
as support for his assertion that there is no "necessary" connection between gene amplification, 
the amount of mRNA and the amount of protein. The first is an article by Meric et al The 
Examiner states that "in a discussion of regulation of gene activity in cancer [Meric] notes that 
'Gene expression is quite complicated, however, and is also regulated at the level of mRNA 
stability, mRNA translation and protein stability (page 971, column 1).'" Office Action at 3. 
From this statement, the Examiner concludes that "Meric teaches that there is not necessarily a 
correlation between mRNA levels and protein levels in cancer cells, since the regulation may 
occur at levels other than that of the mRNA, such as in the level of translation of the mRNA or in 
the stability of the protein." Office Action at 3-4. 

What the Examiner ignores is the preceding statement by the authors: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
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efforts have concentrated on identifying differences in gene expression at the 
level of mRNA , which can be attributable to either DNA amplification or to 
differences in transcription. Meric at 971 (emphasis added). 

This statement supports Appellants' asserted utility. First, Meric teaches that although 
there is more than one mechanism, changes in mRNA in tumors cells can be attributed to gene 
amplification. This clearly supports Appellants' assertion that gene amplification leads to 
increase in mRNA level. 

Second, Meric states that examining changes in mRNA levels is a "fundamental 
principle" of molecular cancer therapeutics. It is true that there is no necessary , i.e., absolute or 
perfect, correlation between gene expression and protein expression because there are other 
mechanism for regulating gene expression. However, exploiting differences in gene expression 
between cancer cells and normal cells would not be a "fundamental principle" of molecular 
cancer therapeutics if there were no significant correlation between gene expression and protein 
levels. Stated another way, changes in mRNA without corresponding changes in protein levels 
would have little or no effect on cellular biology, and those of skill in the art would have no 
reason to examine the differences in gene expression at the mRNA level without such a 
correlation. However, as one of skill in the art recognizes, there is a strong correlation between 
changes in mRNA and changes in protein level. It is because of this strong correlation that it 
remains a "fundamental principle" of molecular therapeutics in cancer to look at changes in 
mRNA level. 

Thus, while Meric does teach that there is no "necessary" correlation between changes in 
mRNA and changes in protein, it supports Appellants' assertion that generally there is a good 
correlation between the two. Appellants reiterate that they need not prove an asserted utility is 
true "beyond a reasonable doubt" or as a matter of "statistical certainty." The evidence must 
lead a person of ordinary skill in the art to conclude that the asserted utility is " more likely than 
not true ." M.P.E.P. § 2107.02, part VII. Because the correlation does not have to be absolute for 
one of skill in the art to believe the asserted utility is more likely than not true, Meric does not 
support the Examiner's rejection, and does not satisfy his burden of presenting "countervailing 
facts and reasoning sufficient to establish that a person of ordinary skill would not believe the 
applicant's assertion of utility." M.P.E.P. §2107.02 III.A., citing in re Brana, 51 F.3d 1560, 
1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). 
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The Examiner's second citation is an article by Gokman-Polar. The Examiner asserts 
that the article "teaches 'Quantitative reverse transcription-PCR analysis revealed that PKC 
mRNA levels do not directly correlate with PKC protein levels, indicating that PKC isozyme 
expression is likely regulated at the posttranscriptional/translational level (see abstract).'" Office 
Action at 4. The Examiner states that Gokman-Polar shows in figures 6 and 7 that "there is no 
increase in mRNA expression for any of the isozymes, while the protein is significantly 
overexpressed as shown by figures 4 and 5. This demonstrates that there is no relationship 
between mRNA levels and protein levels." Office Action at 4 (emphasis added). 

Appellants submit that like Meric, Gokman-Polar is not contrary to Appellants' assertion 
that gene amplification generally leads to overexpression of the mRNA and protein. Appellants 
emphasize that they are asserting that gene amplification leads to an increase in PR0539 mRNA, 
which results in increased PR0539 protein in the tumors where PR0539 is amplified. 
Appellants are not asserting that every change in protein level is a result of gene amplification or 
an increase in mRNA expression - Appellants acknowledge that there are other ways to increase 
protein levels. Therefore, the lack of an large increase in PKC 611 mRNA accompanying an 
increase in PKC fill protein levels, cited by the Examiner as support for his rejection, is not very 
relevant to Appellants' asserted utility since it does not establish that increased mRNA does not 
lead to increased protein. Instead, it shows that not all protein increases are a result of increased 
mRNA. 

In addition, a close review of the entire Gokman-Polar article indicates that the trend in 
the data is that changes in mRNA levels are positively correlated with protein levels, supporting 
Appellants' assertion that the two are generally correlated. In Figure 6, the mRNA levels of two 
isozymes shows a decrease, while the third is increased slightly. This same pattern is seen for 
the corresponding protein levels in Figure 2, although as the Examiner points out, the increase in 
mRNA for the third isozyme is small compared to the increase in protein level. Similarly, 
comparing the mRNA levels of the three isozymes in Figure 7 to the corresponding protein 
levels in Figure 4, with one exception the changes in mRNA levels are positively correlated to 
changes in protein levels. While protein levels do not increase or decrease in direct proportion to 
the changes in mRNA, the trend in five of the six examples is that changes in mRNA levels are 
positively correlated to changes in protein levels. 
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This evidence is hardly sufficient to establish that one of skill in the art would reasonably 
doubt that there is a reasonable correlation between changes in mRNA levels and changes in the 
corresponding protein levels. First, it is evidence taken from only two genes, one of which has 
two splice variants. This is hardly sufficient evidence to refute Appellants' assertion that in 
general, increases in mRNA lead to increases in the corresponding protein. Second, even within 
this small sample, five of the six data points show a positive correlation between changes in 
mRNA and protein levels - decreases in mRNA are associated with decreases in protein, while 
increases in mRNA are associated with increases in protein. Therefore, Gokman-Polar does not 
satisfy the Examiner's burden to establish that one of skill in the art would reasonably doubt the 
asserted utility, and it definitely does not support the Examiner's obviously false statement that 
"there is no relationship between mRNA levels and protein levels." Office Action at 4 (emphasis 
added). 

Finally, the Examiner cites Pennica et al, stating that the article "shows that WISP-2 
DNA was amplified in cancer cells but was actually demonstrated REDUCED RNA expression 
(see abstract)." Office Action at 4 (emphasis in original). Based on this statement, the Examiner 
concludes that "[t]his provides additional evidence that there is no relationship between gene 
amplification and mRNA levels, since mRNA levels have no necessary correlation with gene 
amplification." Id. (emphasis added). 

The Examiner's conclusion that there is no relationship between gene amplification and 

mRNA levels is not supported by the reference. Pennica reports that: 

An analysis of WISP-1 gene amplification and expression in human colon tumors 
showed a correlation between DNA amplification and overexpression , whereas 
overexpression of WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, but its mRNA 
expression was significantly reduced in the majority of tumors compared with the 
expression in normal colonic mucosa from the same patient." Pennica at 14722, 
col. 1 (emphasis added). 

Importantly, the authors report that the WISP-1 gene was amplified approximately 2-fold, similar 
to the gene for PR0539 reported in Example 16. Id. at 14720, col. 2. 

Appellants first note that overexpression of WISP-3 RNA seen in the absence of DNA 
amplification is not relevant to the Appellants' asserted utility - Appellants are not asserting that 
amplification is the only source of gene overexpression. This leaves the results reported for 
WISP-1, where gene amplification (approximately two-fold) is correlated with overexpression of 
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the mRNA, and WISP-2, where gene amplification was apparently not correlated with gene 
overexpression. 

The authors of Pennica offer an explanation for what they obviously viewed as an 
anomalous result for WISP-2: "Because the center of the 20ql3 amplicon [of which WISP-2 is a 
part] has not yet been identified, it is possible that the apparent amplification observed for 
WISP-2 may be caused by another gene in this amplicon." Pennica at 14722, col. 1 (emphasis 
added). Thus, the only example of a lack of positive correlation between gene amplification and 
RNA overexpression relied on by the Examiner may be an artifact. The fact that the authors 
attempt to explain this anomaly only supports Appellants' argument that the accepted 
understanding in the art is that there is a positive correlation between gene amplification and 
overexpression of the mRNA. 

As stated above, the standard for utility is not absolute or even statistical certainty, but 
rather whether one of skill in the art would believe that the asserted utility is more likely than not 
true. One apparent contrary example, when combined with the positive example reported in the 
same reference, is not sufficient to prove that a person of skill in the art would have a reasonable 
doubt that gene amplification is not generally correlated with increased gene expression. The 
Examiner has not shown whether the apparent lack of correlation observed for one of the two 
amplified genes studied is typical, or is merely an exception to the rule of correlation. Indeed, 
the authors' attempt to explain the lack of correlation suggests that this result is the exception, 
and not the rule. 

In summary, Pennica provides one example of a positive correlation between gene 
amplification and increased gene expression, and one possible example of a lack of correlation. 
This single reference reporting a single gene that may show a lack of correlation between gene 
amplification and mRNA overexpression cannot support the Examiner's conclusion that there is 
"no relationship between gene amplification and mRNA levels" - the results of WISP-1 clearly 
show that in some cases (at least half of those relied on by the Examiner) there is a relationship. 
For the same reason, Appellants submit that Pennica is not sufficient to satisfy the Examiner's 
initial burden of establishing that it is more likely than not that one of skill in the art would doubt 
Appellants' assertion that generally, gene amplification leads to overexpression of the mRNA 
and protein. 
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Taken together, the three references relied on by the Examiner are not sufficient to 
support his conclusion that "the overexpression of the nucleic acid is not relevant to the utility of 
the protein." Office Action at 3. First, Meric supports Appellants' assertion that generally, 
changes in mRNA lead to a corresponding change in the level of the encoded protein - that is 
why examining differences between tumor and normal tissue at the mRNA level is a 
"fundamental principle" of molecular cancer therapeutics. Likewise, Meric teaches that mRNA 
overexpression can be attributed to gene amplification. Second, Gokman-Polar reports a positive 
correlation between changes in mRNA level and changes in protein level for five of six samples 
tested. Far from supporting the Examiner's conclusion that there is "no relationship" between 
mRNA and protein, this evidence tends to support Appellants' assertion that changes in mRNA 
and protein are correlated. Finally, the only reference which looked at a correlation between 
gene amplification and gene expression reports one gene where there was a strong correlation 
between the two, and one possible example where there was a lack of positive correlation. This 
evidence is at best inconclusive, with at least half the genes showing a correlation between gene 
amplification and mRNA overexpression. 

While these references may establish that there is no "necessary" correlation between 
gene amplification and overexpression of mRNA and protein, Appellants are not required to 
establish the asserted utility beyond a reasonable doubt or with a statistical certainty. Thus, even 
assuming that the Examiner has proved that there is no "necessary" correlation, this does not 
mean that he has met his initial burden of establishing that it is "more likely than not" that a 
skilled artisan would doubt the asserted utility. Given that two of the cited references actually 
tend to support the Appellants' position, Appellants assert that the Examiner has failed to meet 
his burden of establishing a prima facie case of lack of utility. 

c. The Examiner's arguments regarding the unpredictability of protein 
function from protein sequence are irrelevant 
Appellants next address the Examiner's argument that "there is an abundance of evidence 
that very similar proteins can perform very different functions" and that "even high levels of 
homology do not necessarily correlate with actual protein function. In the current case, where 
the function of PRO-539 (SEQ ID NO: 7) is not known, the expectation is even lower that there 
is any utility that can be derived based upon the sequence." Office Action at 5. 
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Appellants' asserted utility does not rely on the function of the PR0539 protein, or any 
relation between the function of PR0539 and its sequence. The claimed subject matter relates to 
PR0539 polypeptides and polypeptides related to SEQ ID NO:7 which can be used to make 
antibodies which specifically bind to the polypeptide of SEQ ID NO:7. The Appellants' asserted 
utility is the use of the claimed polypeptides as diagnostic tools for cancer, based on the two-fold 
amplification of the PR0539 gene in a majority of the lung and colon tumors tested. This utility 
in no way depends on the function of the PR0539 protein, making the Examiner's arguments 
irrelevant. 

d. The instant case is significantly different from Examples 4 and 12 of the 
Utility Guidelines 

The Examiner argues that the instant case is extremely similar to Example 12 of the 
Utility Guidelines where a protein which was known to be a receptor, but where the ligand was 
unknown, was found to lack utility. The Examiner argues that "[i]n the current case, the putative 
PRO-539 protein, lacks any substantial utility whatsoever, and solely relies upon an small level 
of mRNA overexpression in cancer cells. However, there is no necessary relationship between 
the protein levels or utilities and such an overexpression of the nucleic acid." Office Action at 5 
(emphasis added). The Examiner also asserts that the instant case "directly tracks" Example 4 of 
the Utility Guidelines as well. Office Action at 6. 

First, as stated above, Appellants are not required to establish their asserted utility 
"beyond a reasonable doubt" or to a "statistical certainty." Therefore, any lack of a " necessary " 
relationship between protein levels and overexpression of the nucleic acid does not establish that 
one of skill in the art is more likely than not to doubt the asserted utility. Second, for the reasons 
stated above, the Examiner's references do not establish that one of skill in the art would doubt 
Appellants' assertion that gene amplification generally leads to overexpression of the mRNA and 
protein. For this reason, the instant case is significantly different from Examples 4 and 12 of the 
Utility Guidelines. 

In Example 4, nothing more than the sequence of the protein was disclosed. There was 
no disclosed utility, or description of the chemical, physical or biological properties of the 
protein other than its sequence. Similarly, in Example 12, the specification discloses that an 
isolated protein is a receptor for an uncharacterized protein. The isolated protein is not 
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characterized with respect to its biological function or any disease or body condition that is 
associated with the protein. 

In contrast to Examples 4 and 12, in the instant case the specification teaches that the 
gene encoding the PR0539 protein is amplified at least two-fold in the majority of lung and 
colon tumors tested. Appellants have asserted that one of skill in the art would recognize that it 
is likely that the PR0539 protein is therefore also overexpressed in a majority of lung and colon 
tumors, an assertion supported by expert declarations and references discussed below. This 
biological activity distinguishes the instant case from Examples 4 and 12. 

Instead, the instant case is more like the caveat to Example 12, where the additional 
information that the isolated protein is found on the cell membranes of melanoma cells, but not 
normal skin cells is disclosed. The Utility Guidelines state that this additional information is 
sufficient to establish utility for antibodies to the protein as diagnostic tools. Similarly, the likely 
overexpression of PR0539 protein in lung and colon tumors is also sufficient to establish utility 
for the claimed polypeptides related to PR0539. 

e. The asserted utility is specific 

Finally, Appellants address the Examiner's argument that "the protein, as distinguished 
from the nucleic acid, has not been associated with any disease, any condition, or any other 
specific feature. There is no association of the protein with cancer or with any other disease." 
Office Action at 6. The Examiner states that "a general statement of diagnostic utility, such as 
diagnosing an unspecified disease, would ordinarily be insufficient absent a disclosure of what 
condition can be diagnosed." Id. 

Specific Utility is defined as utility which is "specific to the subject matter claimed," in 
contrast to "a general utility that would be applicable to the broad class of the invention." 
M.P.E.P. § 2107.01, part I (2004). Appellants submit that the evidence of amplification and 
overexpression of PR0539 nucleic acids (see below) in lung and colon tumors along with the 
declarations and references discussed below provide a specific utility for the claimed 
polypeptides. As discussed in more detail below, Appellants have established a reasonable 
correlation between gene amplification and overexpression of the mRNA and protein. The 
Examiner has not provided any substantial evidence or arguments to the contrary. This makes 
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PR0539 and related polypeptides useful in diagnosing lung and colon cancer. This is not a "a 
general statement of diagnostic utility" that would apply to the broad class of polypeptides. 

/ Conclusion - Examiner has failed to establish a prima facie case that 
one of skill in the art would doubt Appellants 9 asserted utility 
The Examiner has relied on essentially two unsupported arguments in rejecting the 
pending claims for lack of utility. First, the Examiner has questioned the sufficiency, reliability 
and significance of the data reported in Example 16. Second, the Examiner has argued that 
because there is no " necessary " connection between gene amplification and mRNA levels, and 
no " necessary " connection between mRNA levels and protein levels, evidence of gene 
amplification does not provide utility for the protein. Appellants have responded to each of these 
arguments in turn. 

First, Appellants have established that the PR0539 gene is amplified at least two-fold in 
a majority of lung and colon tumors tested. The Goddard declaration establishes that the data in 
Example 16 are sufficiently reliable and significant to show that the PR0539 gene is useful as a 
cancer diagnostic tool. The Examiner has not provided any substantial reason or evidence for 
one of skill in the art to doubt the reliability or usefulness of the data in Example 16, or the facts 
and conclusions in the Goddard declaration. 

Second, Appellants have shown that the three references relied on by the Examiner as 
support for his assertion that gene amplification is not relevant to the utility of the protein and 
antibody do not support the Examiner's position, are not contrary to Appellants' assertion of a 
general correlation between gene amplification and mRNA and protein overexpression, and in 
fact lend support to Appellants' position. 

Finally, Appellants have adequately addressed the Examiners' remaining arguments 
regarding a lack of structure function-relationships in proteins; the similarity of the instant case 
to Examples 4 and 12 of the Utility Guidelines, and the asserted lack of a specific utility. 

Taken together, the Examiner's arguments are not sufficient to satisfy the Examiner's 
burden to "provide[] evidence showing that one of ordinary skill in the art would reasonably 
doubt the asserted utility." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 
1995). The Examiner's arguments are largely conclusory statements which are not supported by 
any substantial evidence or reasoning which explains why one of ordinary skill in the art would 

-26- 



Appl. No. : 10/032,996 

Filed : December 27, 2001 



reasonably doubt the asserted utility. And even the scant evidence that is provided supports the 
Appellants' position as much or more than the Examiner's position. Therefore, the Board should 
accept the Appellants' disclosure of utility. See Ex parte Rubin, 5 U.S.P.Q. 2d 1461 (Bd. Pat. 
App. & Interf. 1987) ("There is no factual support in this record for the examiner's questioning 
of the denaturation test reported in the specification. ... No reason to doubt 'the objective truth' 
of the asserted utility having been advanced by the examiner, we accept appellant's disclosure of 
utility corresponding in scope to the claimed subject matter."). 

7. Appellants have provided Sufficient Rebuttal Evidence of Utility 

"Only after the PTO provides evidence showing that one of ordinary skill in the art would 

reasonably doubt the asserted utility does the burden shift to the applicant to provide rebuttal 

evidence." In re Brana, 51 F.3d 1560, 1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). The rebuttal 

evidence must be sufficient such that when it is considered as a whole, it is more likely than not 

that the asserted utility is true. See In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 

1444 (Fed. Cir. 1992) (stating that the evidentiary standard to be used throughout ex parte 

examination in setting forth a rejection is a preponderance of the evidence, or "more likely than 

not" standard). The M.P.E.P. summarizes the standard of proof required: 

[T]he applicant does not have to provide evidence sufficient to establish that an 
asserted utility is true "beyond a reasonable doubt." Nor must the applicant 
provide evidence such that it establishes an asserted utility as a matter of 
statistical certainty. Instead, evidence will be sufficient if, considered as a whole, 
it leads a person of ordinary skill in the art to conclude that the asserted utility is 
more likely than not true . M.P.E.P. § 2107.02, part VII (emphasis in original, 
citations omitted). 

Appellants remind the Board that the Federal Circuit has stated that the standard for satisfying 
the utility requirement is a low one: "The threshold of utility is not high: An invention is 'useful' 
under section 101 if it is capable of providing some identifiable benefit." Juicy Whip, Inc. v. 
Orange Bang, Inc., 185 F.3d 1364, 1366, 51 U.S.P.Q. 2d 1700 (Fed. Cir. 1999). 

Even if the Examiner has satisfied his burden of presenting a prima facie case of lack of 
utility, Appellants have supplied more than enough rebuttal evidence, such that when considered 
as a whole, one of skill in the art would conclude that the asserted utility is more likely than not 
true. As discussed in detail below, Appellants have provided sufficient evidence that the gene 
encoding the PR0539 polypeptide is amplified and overexpressed in lung and colon tumors and 
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can therefore be used as a diagnostic tool In addition, Appellants have shown that it is well 
established in the art that there is a reasonable correlation between gene amplification, changes 
in mRNA level and changes in the corresponding protein level such that one of skill in the art 
would believe that the PR0539 polypeptide is also overexpressed in lung and colon cancers. 
Therefore, considering the evidence as a whole, one of skill in the art would believe that it is 
more likely than not that the claimed polypeptides are useful as diagnostic tools for cancer, 
particularly lung and colon tumors. 

cl Appellants have established that the PRQ539 gene is amplified at least 
two-fold in the majority of lung and colon tumors tested 

As discussed above, the Examiner has not provided any evidence to challenge the 
reliability and significance of the data in Example 16 which reports that the PR0539 gene is 
amplified at least two-fold in a majority of lung and colon tumors tested compared to normal 
tissue. In contrast to this complete lack of evidence on the part of the Examiner, Appellants have 
submitted the Goddard Declaration. That declaration establishes that it is the opinion of an 
expert in the field who has personal knowledge of the facts surrounding Example 16 that a two- 
fold or greater amplification of a gene in tumors makes the gene useful as a diagnostic tool. The 
Examiner has not provided any evidence to challenge the facts and conclusions of the Goddard 
Declaration in support of Example 16. 

Given the disclosure of Example 16 and the supporting Goddard Declaration on the one 
hand, and the complete lack of any evidence or arguments on the other, it is clear that 
considering the evidence as a whole, one of skill in the art would conclude that it is more likely 
than not that the PR0539 gene is amplified at least two-fold in the majority of lung and colon 
tumors tested compared to normal tissue, and therefore the PROS 3 9 gene is useful as a 
diagnostic tool for lung and colon cancer. 

As Appellants explain below, it is more likely than not that the PR0539 mRNA and 
polypeptide are overexpressed in lung and colon tumors, and can therefore be used to distinguish 
tumor tissue from normal tissue. This provides utility for the claimed polypeptides related to the 
PR0539 polypeptide. 
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b. Appellants have established that generally there is a correlation between 
2ene amplification and mRNA overexvression 

Appellants turn next to the evidence offered in support of their assertion that it is well- 
established that gene amplification generally leads to overexpression of the corresponding 
mRNA. The teachings in Genes V, a leading textbook in the field, illustrate that at the time the 
instant application was filed, it was well known by those of skill in the art that gene amplification 
leads to overexpression of the corresponding gene product. See Benjamin Lewin, Genes V, 5 th ed. 
1994, pages 1196-1201. In a section entitled "Insertion, translocation, or amplification may 
activate proto-oncogenes", the text describes various molecular events that lead to 
overexpression of a gene product, using the c-myc gene as an example. The first mechanism 
taught is insertion of a retrovirus upstream of the gene which causes it to be driven by a more 
efficient promoter, resulting in increased mRNA and protein levels. Next, Lewin teaches that 
chromosomal translocations may bring a gene to a new region where it is actively expressed, 
resulting in increased gene and protein expression. The third mechanism whereby protein levels 
of oncogenes are overexpressed is gene amplification. The text emphasizes that the common 
thread among the different means of activation of proto-oncogenes is that the expression of the 
gene is increased . Thus, as of 1994, it was well-known in the art that gene amplification is 
correlated with overexpression of the corresponding mRNA and encoded protein. 

Additional information regarding the understanding of those of skill in the art regarding 
the relationship between gene amplification and protein overexpression at the time the instant 
application was filed is found in Alitalo (Med. Biol., (1984) 62:304-317), and Merlino et al (J. 
Clin. Invest, (1985) 75:1077-1079). Under the heading "Enhanced Expression of Amplified 
Oncogenes," Alitalo states that "[i]n all cases where they have been studied, the amplified 
oncogenes have been found abundantly expressed at the mRNA level roughly in proportion to 
the amount of DNA amplification (see Table 1)." Alitalo at 313 (emphasis added). Table 1 lists 
eleven examples of amplified oncogenes where expression levels were examined. In all eleven 
cases, expression of the amplified oncogene was elevated . Thus, Alitalo clearly teaches that 
amplification leads to overexpression. Merlino et al studied epidermoid carcinoma cells, and 
teach that amplification of the EGF receptor gene results in increased levels of EGF receptor 
mRNA and increased levels of EGF receptor protein. Taken together, the excerpt from Genes V, 
as well as the Alitalo and Merlino references, establish that as of the filing date of the instant 
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application, those of skill in the art appreciated the correlation between gene amplification and 
overexpression of the encoded gene product. 

The teachings of Genes V, Alitalo, and Merlino are confirmed in several more recent 
reports that also document the correlation between gene amplification and levels of protein. 
Appellants have submitted two more recent studies providing evidence that the teachings 
referred to above are still widely accepted by those of skill in the art. Orntoft et al (Molecular 
and Cellular Proteomics, (2002) 1:37-45), studied transcript levels of 5600 genes in malignant 
bladder cancers which were linked to a gain/loss of chromosomal material using an array-based 
method. Orntoft et al showed that there was a gene dosage effect and teach that "in general (18 
of 23 cases) chromosomal areas with more than 2-fold gain of DNA showed a corresponding 
increase in mRNA transcripts." Orntoft at 37, column 1, abstract. In addition, Hyman et al 
(Cancer Research, (2002) 62:6240-6245) used CGH analysis and cDNA microarrays to compare 
DNA copy numbers and mRNA expression of over 12,000 genes in breast cancer tumors and cell 
lines. They showed that there is "evidence of a prominent global influence of copy number 
changes on gene expression levels." Hyman at 6244, column 1, last paragraph. 

Additional supportive teachings are also provided by Pollack et al (PNAS, (2002) 
99:12963-12968) who studied a series of primary human breast tumors and found that "[b]y 
analyzing mRNA levels in parallel, we have also discovered that changes in DNA copy number 
have a large, pervasive, direct effect on global gene expression patterns in both breast cancer cell 
lines and tumors." Pollack at 12967 at column 1, (emphasis added). Their study found that 
"62% of highly amplified genes show moderately or highly elevated expression, that DNA copy 
number influences gene expression across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, a 2-fold change in DNA 
copy number is associated with a corresponding 1.5-fold change in mRNA levels." Pollack at 
12963, column 1, abstract. 

Appellants have also submitted two references which show that gene amplification leads 
to overexpression of the encoded protein. Bahnassy et al (BMC Gastroenterology, (2004) 4:22- 
34), studied the amplification of cyclin Dl and assessed the levels of the encoded protein by 
immunohistochemistry. Bahnassy et al found a "significant correlation between cyclin Dl gene 
amplification and protein overexpression." Bahnassy at 27, column 1. Similarly, Blancato et al 
(British Journal of Cancer, (2004) 90(8), 1612-1619), report that overexpression of c-myc mRNA 

-30- 



Appl. No. : 10/032,996 

Filed : December 27, 2001 



and c-Myc protein is related to the copy number of the c-myc amplification. Blancato at 1613, 
column 2. Bahnassy and Blancato demonstrate continued evidentiary support for the widely- 
accepted principle that gene amplification correlates with overexpression of the encoded protein. 
Presumably, amplification leads to overexpression of the encoded protein by increasing the 
amount of corresponding mRNA. 

Together, these excerpts and articles collectively teach that it is more likely than not that 
gene amplification increases mRNA expression. This evidence establishes that there is a 
reasonable correlation between gene amplification and gene overexpression, and one of skill in 
the art would believe, to a reasonable probability, that gene amplification would lead to increased 
gene expression. 

This conclusion is supported by the declaration of Victoria Smith, Ph.D., an expert in the 
field of Molecular Biology. Exhibit B of her Declaration reports the results of microarray 
analysis conducted on the cDNA of the gene encoding PR0539 (DNA47465). The results 
indicate that the PR0539 mRNA is significantly overexpressed in eight of the twenty-six lung 
tumor samples tested compared to the normal lung tissue controls. That is the equivalent of 
nearly one in every three samples (31%). In addition, four out of five squamous cell lung 
carcinomas (80%) are significantly overexpressed (shown in bold). In contrast, only one of the 
seven individual normal lung tissue samples shows significant overexpression of the PR0539 
gene (14%). The tumors analyzed in the microarray are not the same tumors reported in 
Example 16, and therefore it is not possible to directly relate gene amplification to mRNA 
overexpression. However, these data do support the conclusion that PR0539 mRNA is 
overexpressed in lung tumors, particularly squamous cell lung carcinomas. 

Dr. Smith states that "[i]t is well-established in the art that overexpression of the mRNA 
for a gene is likely to lead to overexpression of the corresponding protein." Smith Declaration at 
1 6. She explains: 

Given the known correlation between overexpression of a gene and the 
corresponding overexpression of the encoded protein, it is very likely that a 
similar number of lung tumors will overexpress the PR0539 protein, while very 
few normal lung tissue samples likely will. Together with the data reported in 
Example 16 that the gene encoding PR0539 is amplified in some lung tumors, 
including squamous cell lung carcinoma, the results reported in Exhibit B indicate 
that the PR0539 gene and protein, as well as antibodies to the encoded protein, 
can be used to differentiate some cancerous lung tissue, particularly squamous 
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cell carcinoma, from normal lung tissue. Smith Declaration at ^ 7 (emphasis in 
original). 

Because not all lung tumors show overexpression of PR0539, it cannot be used to 
exclude a sample being tested as non-cancerous. However, Dr. Smith states that the PR0539 
gene, protein, and corresponding antibodies are useful as a diagnostic tool for lung cancer, 
particularly squamous cell carcinoma, since a very high percentage of squamous cell lung 
carcinomas overexpress the gene and most likely the encoded protein, while very few normal 
lung samples do. 

As discussed above, the Examiner relies on a single apparently contrary example of one 
gene reported by Pennica et al. to support the assertion that there is "no relationship between 
gene amplification and mRNA levels, since mRNA levels have no necessary correlation with 
gene amplification." Office Action at 4 (emphasis added). As Appellants discussed above, this 
possible contrary example is balanced by the report in Pennica that the amplification of the 
WISP-1 gene by about two-fold led to overexpression of the corresponding mRNA. Thus, 
Pennica is at best inconclusive with one example of a positive correlation and one possible 
contrary example. If the example of WISP-2 relied on by the Examiner is an artifact as the 
authors suggest is possible, the Pennica reference actually supports Appellants' position. 

As stated above, the standard for utility is not absolute or even statistical certainty, but 
rather whether considering the evidence as a whole, one of skill in the art would be more likely 
than not to believe the asserted utility. Even if Pennica supported the PTO's argument, which it 
does not, one contrary example is not sufficient to prove that a person of skill in the art would 
have a reasonable doubt that gene amplification is not correlated to gene expression. Given the 
evidence provided by the Appellants which establishes that there is a correlation between gene 
amplification and mRNA expression, one of skill in the art would believe, to a reasonable 
probability, that the reported amplification of the PR0539 gene would lead to an increase in the 
level 0fPRO539mRNA. 
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c. Appellants have established that generally there is a correlation between 
changes in mRNA expression levels and changes in the expression level 
of the encoded protein 

Appellants turn to the next portion of their argument in support of their asserted utility - 
that it is well-established in the art that in most cases a change in the level of mRNA for a 
particular protein leads to a corresponding change in the level of the encoded protein. Given 
Appellants' evidence of two-fold amplification and overexpression of the PR0539 gene in 
certain lung and colon tumors, it is more likely than not that the PR0539 polypeptide is likewise 
overexpressed, and therefore the claimed polypeptides are useful as diagnostic tools, particularly 
for lung and colon tumor. 

In support of the assertion that changes in mRNA are positively correlated to changes in 
protein levels, Appellants submitted a Declaration by J. Christopher Grimaldi, an expert in the 
field of cancer biology. As stated in paragraph 5 of the declaration, "Those who work in this 
field are well aware that in the vast majority of cases, when a gene is over-expressed. . .the gene 
product or polypeptide will also be over-expressed.... This same principal applies to gene 
under-expression." Grimaldi Declaration at 5. Further, "increased mRNA expression is 
expected to result in increased polypeptide expression, and the detection of decreased mRNA 
expression is expected to result in decreased polypeptide expression." Id. 

Appellants also submitted the declaration of Paul Polakis, Ph.D. an expert in the field of 

cancer biology. As stated in paragraph 6 of his declaration: 

Based on my own experience accumulated in more than 20 years of research, 
including the data discussed in paragraphs 4 and 5 above [showing a positive 
correlation between mRNA levels and encoded protein levels in the vast majority 
of cases studied in relation to the present invention] and my knowledge of the 
relevant scientific literature, it is my considered scientific opinion that for human 
genes, an increased level of mRNA in a tumor cell relative to a normal cell 
typically correlates to a similar increase in abundance of the encoded protein in 
the tumor cell relative to the normal cell. In fact, it remains a central dogma in 
molecular biology that increased mRNA levels are predictive of corresponding 
increased levels of the encoded protein, Polakis Declaration at If 6 (emphasis 
added). 

Dr. Polakis acknowledges that there are published cases where such a correlation does not exist, 
but states that it is his opinion, based on over 20 years of scientific research, that "such reports 
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are exceptions to the commonly understood general rule that increased mRNA levels are 
predictive of corresponding increased levels of the encoded protein." Polakis Declaration at \ 6. 

As mentioned above, the Smith Declaration also states that "[i]t is well-established in the 
art that overexpression of the mRNA for a gene is likely to lead to overexpression of the 
corresponding protein." Smith Declaration at If 6. She explains that "[g]iven the known 
correlation between overexpression of a gene and the corresponding overexpression of the 
encoded protein" it is very likely that those lung tumors that overexpress the PR0539 mRNA 
will also overexpress the PR0539 protein, and this indicates "that the PR0539 gene and protein, 
as well as antibodies to the encoded protein, can be used to differentiate some cancerous lung 
tissue, particularly squamous cell carcinoma, from normal lung tissue." Smith Declaration at 7. 

The Declarations of Grimaldi, Polakis and Smith are supported by the teachings in 
Molecular Biology of the Cell, a leading textbook in the field (Bruce Alberts, et al., Molecular 
Biology of the Cell (3 rd ed. 1994) (hereinafter "Cell 3 rd ") and (4 th ed. 2002) (hereinafter "Cell 
4 th ")). Figure 9-2 of Cell 3 rd shows the steps at which eukaryotic gene expression can be 
controlled. The first step depicted is transcriptional control. Cell 3 rd provides that "[f]or most 
genes transcriptional controls are paramount. This makes sense because, of all the possible 
control points illustrated in Figure 9-2, only transcriptional control ensures that no superfluous 
intermediates are synthesized." Cell 3 rd at 403 (emphasis added). In addition, the text states that 
"Although controls on the initiation of gene transcription are the predominant form of regulation 
for most genes , other controls can act later in the pathway from RNA to protein to modulate the 
amount of gene product that is made." Cell 3 rd at 453 (emphasis added). Thus, as established in 
Cell 3 rd , the predominant mechanism for regulating the amount of protein produced is by 
regulating transcription. 

In Cell 4 th , Figure 6-3 on page 302 illustrates the basic principle that there is a correlation 
between increased gene expression and increased protein expression. The accompanying text 
states that "a cell can change (or regulate) the expression of each of its genes according to the 
needs of the moment - most obviously by controlling the production of its mRNA ." Cell 4 th at 
302 (emphasis added). Similarly, Figure 6-90 on page 364 of Cell 4 th illustrates the path from 
gene to protein. The accompanying text states that while potentially each step can be regulated 
by the cell, " the initiation of transcription is the most common point for a cell to regulate the 
expression of each of its genes ." Cell 4 th at 364 (emphasis added). This point is repeated on 
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page 379, where the authors state that of all the possible points for regulating protein expression, 
" [f]or most genes transcriptional controls are paramount ," Cell 4 at 379 (emphasis added). 

Further support for Appellants' position can be found in the textbook, Genes VI, 
(Benjamin Lewin, Genes VI (1997)) which states "having acknowledged that control of gene 
expression can occur at multiple stages, and that production of RNA cannot inevitably be 
equated with production of protein, it is clear that the overwhelming majority of regulatory 
events occur at the initiation of transcription ." Genes VI dX 847-848 (emphasis added). 

Additional support is also found in Zhigang et ai y World Journal of Surgical Oncology 
2:13, 2004. Zhigang studied the expression of prostate stem cell antigen (PSCA) protein and 
mRNA to validate it as a potential molecular target for diagnosis and treatment of human 
prostate cancer. The data showed "a high degree of correlation between PSCA protein and 
mRNA expression" Zhigang at 4. Of the samples tested, 81 out of 87 showed a high degree of 
correlation between mRNA expression and protein expression. The authors conclude that "it is 
demonstrated that PSCA protein and mRNA overexpressed in human prostate cancer, and that 
the increased protein level of PSCA was resulted from the upregulated transcription of its 
mRNA." Id. at 6. Even though the correlation between mRNA expression and protein 
expression occurred in 93% of the samples tested, not 100%, the authors state that "PSCA may 
be a promising molecular marker for the clinical prognosis of human Pea and a valuable target 
for diagnosis and therapy of this tumor." Id. at 7. 

Further, as noted above, Meric et al., Molecular Cancer Therapeutics, vol. 1, 971-979 

(2002), states the following: 

The fundamental principle of molecular therapeutics in cancer is to exploit the 
differences in gene expression between cancer cells and normal cells... [M]ost 
efforts have concentrated on identifying differences in gene expression . at the 
level of mRNA, which can be attributable to either DNA amplification or to 
differences in transcription. Meric et al. at 971 (emphasis added). 

Those of skill in the art would not be focusing on differences in gene expression between cancer 
cells and normal cells if there were no correlation between gene expression and protein 
expression. 

Together, the declarations of Grimaldi, Polakis, and Smith, the accompanying references, 
and the excerpts and references discussed above all establish that the accepted understanding in 
the art is that there is a reasonable correlation between changes in gene expression and changes 
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in the level of the encoded protein. In contrast to this substantial amount of evidence supporting 
Appellants' position, the Examiner has two references, Meric et al and Gokman-Polar et al. 
However, as discussed above, Meric and Gokman-Polar actually support Appellants' position, 
not the Examiner's. It is clear that when considered as a whole, the preponderance of the 
evidence clearly weighs in favor of Appellants. 

d. The Examiner has failed to consider all of the evidence of record as a 
whole 

Even assuming that the Examiner has established a prima facie case that one of skill in 
the art would doubt the truth of the asserted utility, Appellants have presented sufficient rebuttal 
evidence. The rebuttal evidence "will be sufficient if, considered as a whole , it leads a person of 
ordinary skill in the art to conclude that the asserted utility is more likely than not true . M.P.E.P. 
§ 2107.02, part VII (emphasis in original, citations omitted). 

The Examiner has not considered the evidence as a whole. Instead, the Examiner has 
summarily stated that the Grimaldi and Polakis Declarations are "fundamentally flawed because 
they fail to provide specific evidence regarding Pro-539." Office Action mailed 911 104 at 14. In 
response to the Declaration of Dr. Smith which has specific evidence regarding PR0539, the 
Examiner is silent, completely ignoring her declaration. In response to the extensive references 
cited by the Appellants, the Examiner has summarily dismissed them as "nonspecific arguments" 
that do not relate to PR0539. Ojfi& Action at 19. He argues that "[n]one of the references 
demonstrate that there is a 'reasonable probability' that the Pro-539 protein is overexpressed or 
that antibodies to the Pro-539 protein itself have any utility." Id. Instead of weighing the 
evidence as a whole as required, the Examiner summarily states that "[a]s noted in the rejection, 
there are other articles which demonstrate that there is no necessary relationship [between 
nucleic acid overexpression and protein overexpression] for every protein." Id. (emphasis 
added). 

Appellants have established that the PR0539 gene is amplified at least two-fold in the 
majority of lung and colon tumors tested. Appellants have submitted substantial evidence in the 
form of additional data, expert declarations, reference articles and textbooks, to support their 
assertion that it is more likely than not that this gene amplification leads to overexpression of the 
PR0539 mRNA and polypeptide. This makes the claimed polypeptides related to PR0539 
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polypeptide useful for diagnosing cancer, particularly lung and colon tumors. The Examiner has 
provided almost no evidence to the contrary. Given the overwhelming amount of evidence in 
support of Appellants' position, and the near absence of any evidence in support of the 
Examiner's position, when considered as a whole the evidence leads a person of ordinary skill in 
the art to conclude that the asserted utility is more likely than not true. 

8. The Courts have held that the Utility Requirement was Satisfied in Similar 
Cases 

The seminal decision interpreting the utility requirement of 35 U.S.C. § 101 is Brenner v. 
Manson, 383 U.S. 519, 148 U.S.P.Q. 689 (1966). At issue in Brenner was a claim to "a 
chemical process which yields an already known product whose utility - other than as a possible 
object of scientific inquiry - ha[d] not yet been evidenced." Id. at 529, 148 U.S.P.Q. at 693. The 
Patent Office rejected the claimed process for lack of utility because the product produced by the 
claimed process had no known use. See id. at 521-22, 148 U.S.P.Q. at 690. On appeal, the Court 
of Customs and Patent Appeals reversed, holding "where a claimed process produces a known 
product it is not necessary to show utility for the product." Id. at 522, 148 U.S.P.Q. at 691 . 

In reviewing the lower court's decision, the Court made its oft quoted statement that 
u [t]he basic quid pro quo contemplated by the Constitution and the Congress for granting a 
patent monopoly is the benefit derived by the public from an invention with substantial utility. 
Unless and until a process is refined and developed to this point - where specific benefit exists in 
currently available form - there is insufficient justification for permitting an applicant to engross 
what may prove to be a broad field." Id. at 534-35, 148 U.S.P.Q. at 695. 

The first opinion of the C.C.P.A. applying Brenner was In re Kirk, 376 F.2d 936, 153 
U.S.P.Q. 48 (C.C.P.A. 1967). The invention claimed in Kirk was a set of steroid derivatives said 
to have valuable biological properties and to be of value "in the furtherance of steroidal research 
and in the application of steroidal materials to veterinary or medical practice." Id. at 938, 153 
U.S.P.Q. at 50. In affirming the claim rejection based on a lack of utility, the court held that the 
"nebulous expressions 'biological activity' or 'biological properties'" did not adequately convey 
how to use the claimed compounds. Id. at 941, 153 U.S.P.Q. at 52. The court also rejected 
appellants' supporting affidavit, stating, "the sum and substance of the affidavit appears to be 
that one of ordinary skill in the art would know 'how to use' the compounds to find out in the 
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first instance whether the compounds are - or are not - in fact useful or possess useful properties, 
and to ascertain what those properties are." Id. at 942, 153 U.S.P.Q. at 53. 

The instant case is clearly distinguishable from Kirk. In Kirk, the asserted utility for the 
claimed compounds was "a new class of compounds often possessing high biological activity" 
and "intermediates in the preparation of compounds with valuable biological properties...". Id. 
at 1120, 1121. Here, Appellants have asserted a much more specific utility than "biological 
activity" or "biological properties." Appellants have provided evidence of amplification and 
overexpression of the PR0539 gene in certain cancers and have shown that this evidence is 
reasonably correlated to overexpression of the PR0539 polypeptide in those same cancers, 
namely, lung and colon cancer. As Example 12 of the Utility Guidelines make clear, when a 
protein is differentially expressed in cancer compared to normal tissue, the protein and antibodies 
have utility in diagnosing the cancer. 

Since the Brenner and Kirk decisions, the courts have continued to clarify what is 
sufficient to satisfy the utility requirement. Three more recent decisions are of particular 
relevance to the instant application: Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. 881 (C.C.P.A. 
1980), Cross v. Iizuka, 753 F.2d 1040, 224 U.S.P.Q. 739 (Fed. Cir. 1985), and Fujikawa v. 
Wattanasin, 93 F.3d 1559, 39 U.S.P.Q. 2d 1895 (Fed. Cir. 1996). 

The earliest of these cases, Nelson v. Bowler, involved an interference between two 
applications related to derivatives of naturally occurring prostaglandins (PG). Nelson, 626 F.2d 
at 854-55. The issue was whether Nelson had shown at least one utility for the compounds at 
issue to establish an actual reduction to practice. Id. at 855. The Appellants relied on two tests 
to prove practical utility: an in vivo rat blood pressure (BP) test and an in vitro gerbil colon 
smooth muscle stimulation (GC-SMS) test. In the BP test, the blood pressure of anesthetized 
rats was recorded on a polygraph chart to determine whether an injected compound had any 
effect. Responses were categorized as either a depressor (lowering) effect or a pressor 
(elevating) effect. Id. In the GC-SMS test a section of colon was excised from a freshly-killed 
gerbil for suspension in a physiological solution, and a lever arm was connected to the colon in 
such a way that any contraction was recorded as a polygraph trace. Id. The Board held that 
Nelson had not shown adequate proof of practical utility, characterizing the tests as "rough 
screens, uncorrected with actual utility." Id. at 856. 
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On appeal the C.C.P.A. reversed, holding that the Board "erred in not recognizing that 

tests evidencing pharmacological activity may manifest a practical utility even though they may 

not establish a specific therapeutic use." Id. The Court stated that "practical utility" was 

characterized as a use of the claimed discovery in a manner which provides some immediate 

benefit to the public, establishing the following rule: 

Knowledge of the pharmacological activity of any compound is obviously 
beneficial to the public. It is inherently faster and easier to combat illnesses and 
alleviate symptoms when the medical profession is armed with an arsenal of 
chemicals having known pharmacological activities. Since it is crucial to provide 
researchers with an incentive to disclose pharmacological activities in as many 
compounds as possible, we conclude that adequate proof of any such activity 
constitutes a showing of practical utility. Id. (emphasis added). 

The Court rejected Bowler's argument that the BP and GC-SMS tests are inconclusive 
showings of pharmacological activity since confirmation by statistically significant means did 
not occur until after the critical date. The Court stated that "a rigorous correlation is not 
necessary where the test for pharmacological activity is reasonably indicative of the desired 
response." Id. (emphasis added). The Court concluded that a " reasonable correlation " between 
the observed properties and the suggested use was sufficient to establish practical utility. Id. at 
857. 

The sufficiency of a "reasonable correlation" in establishing utility was affirmed by the 
Court of Appeals for the Federal Circuit in Cross v. Iizuka, 753 F.2d 1040, 224 U.S.P.Q. 739 
(Fed. Cir. 1985). In Cross, the subject of the interference before the Court was imidazole 
derivative compounds which inhibit the synthesis of thromboxane synthetase, an enzyme which 
leads to the formation of thromboxane A2. At the time the applications were filed, 
thromboxane A2 was postulated to be involved in platelet aggregation, which was associated 
with several deleterious conditions. Id. at 1042. 

The question before the Board and reviewed by the Court was whether Iizuka was 
entitled to the benefit of his Japanese priority application. Id. The Japanese application 
disclosed that the imidazole derivatives showed strong inhibitory action for thromboxane 
synthetase from human or bovine platelet microsomes, an in vitro utility. Id. at 1043. Relying in 
part on Nelson, the Board held that tests evidencing pharmacological activity may manifest a 
practical utility even though they may not establish a specific therapeutic use, and concluded that 
the in vitro tests were sufficient to establish a practical utility. Id. 

-39- 



Appl. No. : 10/032,996 

Filed : December 27, 2001 



On appeal, Cross argued that the basic in vitro tests conducted in cellular fractions did not 
establish a practical utility for the claimed compounds, and that more sophisticated in vitro or in 
vivo tests were necessary to establish a practical utility. Id. at 1050. The Court rejected this 
argument, noting that adequate proof of any pharmaceutical activity constitutes a showing of 
practical utility. Id. The Court accepted the argument that initial testing of compounds is widely 
done in vitro: 

[I\n vitro results... are generally predictive of in vivo test results, i.e., there is a 
reasonable correlation therebetween. Were this not so, the testing procedures of 
the pharmaceutical industry would not be as they are. Iizuka has not urged, and 
rightly so, that there is an invariable exact correlation between in vitro test results 
and in vivo test results. Rather, Iizuka' s position is that successful in vitro testing 
for a particular pharmacological activity establishes a significant probability that 
in vivo testing for this particular pharmacological activity will be successful. Id. 
(emphasis added). 

The Court also noted that in previous decisions, its predecessor court had accepted 

evidence of in vivo utility as sufficient to establish practical utility. The Court reasoned that: 

This in vivo testing is but an intermediate link in a screening chain which may 
eventually lead to the use of the drug as a therapeutic agent in humans. We 
perceive no insurmountable difficulty, under appropriate circumstances, in finding 
that the first link in the screening chain, in vitro testing, may establish a practical 
utility for the compound in question. Successful in vitro testing will marshal 
resources and direct the expenditure of effort to further in vivo testing of the most 
potent compounds, thereby providing an immediate benefit to the public, 
analogous to the benefit provided by the showing of an in vivo utility . Id. at 1051, 
citing Nelson, 626 F.2d at 856 (emphasis added). 

Based on this reasoning, the Court affirmed the decision of the Board, stating that "based 
upon the relevant evidence as a whole, there is a reasonable correlation between the disclosed in 
vitro utility and an in vivo activity, and therefore a rigorous correlation is not necessary where 
the disclosure of pharmacological activity is reasonable based upon the probative evidence." Id. 
at 1050 (emphasis added). The Court therefore held that the disclosed in vitro utility was 
"sufficient to comply with the practical utility requirement of § 101." Id. at 1051. 

The holdings of Nelson and Cross were more recently affirmed in Fujikawa v. 

Wattanasin, 93 F.3d 1559, 39 U.S.P.Q.2d 1895 (Fed. Cir. 1996). In Fujikawa, the Court again 

affirmed the notion that initial screens of compounds provide a practical utility even though they 

may not provide a therapeutic use because "'[i]t is inherently faster and easier to combat 

illnesses and alleviate symptoms when the medical profession is armed with an arsenal of 
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chemicals having known pharmacological activities.'" Id. at 1564, quoting Nelson, 626 F.2d at 

856. The Court noted that it may be difficult to predict whether novel compounds will exhibit 

pharmacological activity, and consequently testing is often required to establish practical utility. 

Id. However the Court went on to state: 

But the test results need not absolutely prove that the compound is 
pharmacologically active. All that is required is that the tests be "reasonably 
indicative of the desired [pharmacological] response." In other words, there must 
be a sufficient correlation between the tests and an asserted pharmacological 
activity so as to convince those skilled in the art, to a reasonable probability , that 
the novel compound will exhibit the asserted pharmacological behavior." Id. 
(internal citations omitted, underline emphasis added, italics in original). 

On appeal, Fujikawa argued that Wattanasin had failed to establish an adequate 
correlation between the in vitro and in vivo results to permit Wattanasin to rely on positive in 
vitro results to establish a practical utility. The Court stated that the Board relied on testimony 
from those skilled in the art that the in vitro results convinced the experts that the claimed 
compounds would exhibit the desired pharmacological activity when administered in vivo, 
including testimony that in vivo activity is typically highly correctable to a compound's in vitro 
activity in the field. Id. at 1565. To overcome this evidence and counter the Board's decision, 
Fujikawa pointed to the testimony of its expert that "there is a reasonable element of doubt that 
some elements may be encountered which are active in the in vitro assay, but yet inactive in the 
in vivo assay." Id. 

The Court rejected this argument: "Of course, it is possible that some compounds active 
in vitro may not be active in vivo. But, as our predecessor court in Nelson explained, a 'rigorous 
correlation' need not be shown in order to establish practical utility: 'reasonable correlation' 
suffices ." Id. (emphasis added). The Court also rejected Fujikawa's reliance on two articles. 
The Court noted that while one article taught that "in vitro testing is sometimes not a good 
indicator of how potent a compound will be in vivo f it does imply that compounds which are 
active in vitro will normally exhibit some in vivo activity." Id. at 1566. Similarly, the Court 
noted that the second article expressly stated that "[f]or most substances, although not for all, the 
relative potency determined in in vitro . . . parallels the in vivo activity." Id. 

The Court concluded that the facts in the case were analogous to the ones in Cross where 
the court relied on a known reasonable correlation between in vitro tests and in vivo activity, and 
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therefore affirmed the Board's decision that Wattanasin had established a practical utility with 
the in vitro results. Id. at 1565-66. 

The Nelson, Cross, and Fujikawa cases are very similar to the present case. The 
reasoning of the courts in all three cases that "'[i]t is inherently faster and easier to combat 
illnesses and alleviate symptoms when the medical profession is armed with an arsenal of 
chemicals having known pharmacological activities'" applies to the asserted utility for the 
claimed polypeptides. Fujikawa, 93 F.3d at 1564, quoting Nelson, 626 F.2d at 856; see also 
Cross, 753 F.2d at 1051 ("Successful in vitro testing will marshal resources and direct the 
expenditure of effort to further in vivo testing of the most potent compounds, thereby providing 
an immediate benefit to the public, analogous to the benefit provided by the showing of an in 
vivo utility."). Like pharmaceutical compounds, nucleic acids, polypeptides, and antibodies 
which are associated with cancer will make it inherently faster and easier to combat cancer. The 
greater the number of biological markers of cancer medical professionals have access to, the 
more accurate and detailed a diagnosis they can make. The determination that a gene is 
amplified and differentially expressed in cancer compared to normal tissue constitutes at least as 
significant a development in the field of cancer diagnostics as in vitro screening for 
pharmaceutical activity. See Cross, 753 F.2d at 1051 ("the first link in the screening chain, in 
vitro testing, may establish a practical utility for the compound in question. Successful in vitro 
testing will marshal resources and direct the expenditure of effort to further in vivo testing of the 
most potent compounds, thereby providing an immediate benefit to the public"). 

In addition, like in vitro tests in the pharmaceutical industry, those of skill in the field of 
biotechnology rely on the reasonable correlation that exists between gene amplification and 
mRNA and protein expression (see discussion supra). Were there no reasonable correlation 
between them, the techniques that measure gene levels such as microarray analysis, differential 
display, and quantitative PCR would not be so widely used by those in the art. See Grimaldi 
Declaration at %5. As in Cross, Appellants here do not argue that there is "an invariable exact 
correlation" between gene amplification, mRNA, and protein expression. See Cross, 753 F.2d at 
1050. Instead, Appellants' position detailed above is that a measured change in gene 
amplification and gene expression in cancer cells establishes a "significant probability" that the 
expression of the encoded polypeptide in cancer will also be changed based on "a reasonable 
correlation therebetween." Id.\ see also Fujikawa, 93 F.3d at 1565 ("a 'rigorous correlation' 
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need not be shown in order to establish practical utility; 'reasonable correlation' suffices"); 
Nelson, 626 F.2d at 857 (holding that "a rigorous correlation is not necessary" and that a 
"reasonable correlation" will suffice). 

Also of importance is the Court's rejection of the notion that any in vitro testing must be 
statistically significant to support a practical utility. Nelson, 626 F.2d at 857. Likewise, 
qualitative characterizations of a test compound as either increasing or decreasing blood pressure 
was acceptable. Id. at 855 (stating that responses were categorized as either a depressor 
(lowering) effect or a pressor (elevating) effect). This is similar to the data in Example 16, 
where the gene amplification was not evaluated for statistical significance, although Dr. 
Goddard's unchallenged declaration makes clear that the data are reliable and significant. 

There are additional similarities. In Fujikawa, the Board and Court rejected the argument 
that there was no utility because there was no exact correlation between the in vitro and in vivo 
results in spite of testimony and references establishing a lack of a "necessary" correlation. 
Fujikawa, 93 F.3d at 1565-66. Like the two references rejected by the Board and Court in 
Fujikawa, the references cited by the Examiner may suggest that the correlation between gene 
amplification and increases in mRNA levels and protein levels is not exact. But like Fujikawa, 
portions of the references also support Appellants' assertion, and Appellants have submitted the 
declaration of two experts in the field which state that those in the field rely on the correlation 
between changes in mRNA and protein. See Grimaldi Declaration at f 5; Polakis Declaration at 
1} 6. Thus, as was the case in Fujikawa, although there may be some evidence that the correlation 
relied on is not exact, the declarations and numerous references submitted by Appellants are 
more than enough evidence to establish that there is a "reasonable correlation" between gene 
amplification, changes in mRNA levels and changes in the level of the encoded protein. 

In conclusion, Appellants have asserted that the claimed polypeptides are useful for the 
diagnosis of cancer, particularly lung and colon cancer based on the data in Example 16. This 
utility is far beyond the nebulous expressions "biological activity" or "biological properties" 
rejected in In re Kirk, 376 F.2d 936, 153 U.S.P.Q. 48 (C.C.P.A. 1967). Like Nelson, Cross, and 
Fujikawa, Appellants have asserted a utility which relies on a reasonable correlation between the 
data disclosed in the application and the asserted utility. The fact that there may be limited 
evidence that the correlation is not exact does not invalidate Appellants' showing of utility since 
the correlation need not be a rigorous or exact one. Considering the relevant evidence as a whole, 
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Appellants have provided sufficient evidence to establish a reasonable correlation between gene 
amplification, changes in the level of mRNA and corresponding changes in the level of the 
encoded polypeptide. Therefore the claimed polypeptides have a practical utility as diagnostic 
tools for lung and colon cancer. 

9. Utility - Conclusion 

Appellants' asserted utility for the claimed polypeptides as diagnostic tools for cancer 
corresponds in scope to the subject matter sought to be patented and therefore "must be taken as 
sufficient to satisfy the utility requirement of § 101 for the entire claimed subject." In re hanger, 
503 F.2d 1380, 1391, 183 U.S.P.Q. 288, 297 (C.C.P.A. 1974). The Examiner's unsupported 
arguments and references are not sufficient evidence to make a prima facie showing that "one of 
ordinary skill in the art would reasonably doubt the asserted utility." In re Brana, 51 F.3d 1560, 
1566, 34 U.S.P.Q.2d 1436 (Fed. Cir. 1995). And even if the Examiner has established a prima 
facie case, Appellants have offered sufficient rebuttal evidence in the form of expert declarations 
and references, which, when considered as a whole, establish that it is more likely than not that 
the asserted utility is true. See In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444 
(Fed. Cir. 1992) (stating that the evidentiary standard to be used throughout ex parte examination 
in setting forth a rejection is a preponderance of the evidence, or "more likely than not" 
standard); M.P.E.P. § 2107.02, part VII ("evidence will be sufficient if, considered as a whole, it 
leads a person of ordinary skill in the art to conclude that the asserted utility is more likely than 
not true .") (emphasis in original). 

Finally, the courts' decisions in similar cases make clear that the evidence provided by 
Appellants is sufficient to establish the asserted utility. The evidence does not need to be direct 
evidence, nor does it need to provide an exact correlation between the submitted evidence and 
the asserted utility. Instead, evidence which is "reasonably" correlated with the asserted utility is 
sufficient. See Fujikawa, 93 F.3d at 1565 ("a 'rigorous correlation' need not be shown in order 
to establish practical utility; 'reasonable correlation' suffices"); Cross, 753 F.2d at 1050 (same); 
Nelson, 626 F.2d at 857 (same). Considering the evidence as a whole in light of the relevant 
cases, the Board should find that Appellants have established at least one specific, substantial, 
and credible utility, and the Examiner's rejection of the pending claims as lacking utility should 
be reversed. 
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C. Written Description Rejection - Detailed Arguments 

The second issue before the Board is whether the claimed subject matter is described in 
the specification in such a way as to reasonably convey to one skilled in the art that the inventors 
had possession of the claimed invention at the time the application was filed. The Examiner has 
rejected Claims 48-54 under 35 U.S.C. §112, first paragraph, as lacking an adequate written 
description, stating that "there is no record of description which would demonstrate conception 
of any proteins other than those expressly disclosed which comprise SEQ ID NO:7." Office 
Action at 7. 

The Examiner has failed to meet his initial burden of rebutting the presumption that the 
written description is adequate because nowhere in the Final Office Action does the Examiner 
address his arguments to Claims 48-54, and the arguments he has made are either flawed, or do 
not apply to Claims 48-54. For the reasons detailed below, Appellants submit that Claims 48-54 
are adequately described such that one of skill in the art would recognize that the inventors had 
possession of the claimed invention at the time the application was filed. 

1. Written Description - Legal Standard 

The well-established test for sufficiency of support under the written description 

requirement of 35 U.S.C. §112, first paragraph is stated by the Court in Vas-Cath, Inc. v. 

Mahurkar, 935 F.2d 1555, 19 U.S.P.Q.2d 1111 (Fed. Cir. 1991): 

"Although [the applicant] does not have to describe exactly the subject matter 
claimed, ... the description must clearly allow persons of ordinary skill in the art 
to recognize that [he or she] invented what is claimed." "The test for sufficiency 
of support in a parent application is whether the disclosure of the application 
relied upon 'reasonably conveys to the artisan that the inventor had possession at 
that time of the later claimed subject matter.'" Vas-Cath, Inc. v. Mahurkar, 935 
F.2d at 1562-63, 19 U.S.P.Q.2d at 1 1 16 (citations omitted) 

2. Written Description - Burden of Proof 

The M.P.E.P. states that "[a] description as filed is presumed to be adequate , unless or 

until sufficient evidence or reasoning to the contrary has been presented by the examiner to rebut 

the presumption. See, e.g., In re Marzocchi, 439 F.2d 220, 224, 169 U.S.P.Q. 367, 370 (C.C.P.A. 

1971)." M.P.E.P. § 2163.04 (emphasis added). Therefore "[t]he examiner has the initial burden 

of presenting by a preponderance of evidence why a person skilled in the art would not recognize 
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in an applicant's disclosure a description of the invention defined by the claims. Wertheim, 541 
F.2d at 263, 191 U.S.P.Q. at 97." Id. Only then does the Applicant need to respond to the 
Examiner's arguments. 

3. Written Description - Standard of Proof 

The adequacy of written description support is a factual issue and is to be determined on 
a case-by-case basis. See e.g., Vas-Cath, Inc. v. Mahurkar, 935 F.2d at 1563, 19 U.S.P.Q.2d at 
1116 (Fed. Cir. 1991) (emphasis added). The factual determination in a written description 
analysis depends on the nature of the invention and the amount of knowledge imparted to those 
skilled in the art by the disclosure. Union Oil v. Atlantic Richfield Co., 208 F.3d 989, 996 (Fed. 
Cir. 2000). As with the utility requirement, the evidentiary standard to be used throughout ex 
parte examination in setting forth a rejection is a preponderance of the evidence, or " more likely 
than not " standard. In re Oetiker, 911 F.2d 1443, 1445, 24 U.S.P.Q.2d 1443, 1444 (Fed. Cir. 
1992). 

4. The Examiner's Arguments 

To overcome the presumption that the claimed subject matter is adequately described, the 
Examiner must present "evidence why a person skilled in the art would not recognize in an 
applicant's disclosure a description of the invention defined by the claims. Wertheim, 541 F.2d 
at 263, 191 U.S.P.Q. at 97." M.P.E.P. § 2163.04. The Examiner has made essentially three 
arguments in an attempt to rebut this presumption. 

a. The rejected claims allegedly encompasses a large genus for which no 
written description is provided 
The Examiner has asserted that that the claims "are open to a protein that ranges from 
95% to 99% identical to SEQ ID NO: 7, without any guidance on conserved portions of the 
protein structure." The Examiner states that "[t]his large genus is represented in the specification 
by only the particularly named SEQ ID NO:7," that "no common element or attributes of the 
sequences are disclosed," and that "[t]here is no showing or evidence which links structural 
limitations or requirements to any particular functional limitations." Office Action at 7-8. 
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b. The rejected claims allegedly encompasses alternately spliced versions of 
the proteins, etc. for which no written description is provided 

The Examiner also makes the assertion that the claims encompass variants of the SEQ ID 
NO: 7 including "alternately spliced versions of the proteins, allelic variants including insertions 
and mutations, inactive precursor proteins which have a removable amino terminal end," and that 
"[n]o written description of alleles, of upstream or downstream regions containing additional 
sequence, or of alternative splice variants has been provided in the specification." Office Action 
at 8. 

c. The Examiner alleges that the instant case is similar to several court 
decisions 

The Examiner cites several cases for support of his argument that the rejected claims lack 
written description. First, the Examiner cites Regents of the University of California v. Eli Lilly, 
119 F.3d 1559, 43 U.S.P.Q.2d 1398 (Fed. Cir. 1997). The Examiner reproduces a portion of the 
opinion which states that "[a] definition by function, as we have previously indicated, does not 
suffice to define the genus because it is only an indication of what the gene does, rather than 
what it is. . .. Accordingly, naming a type of material generally known to exist, in the absence of 
knowledge as to what that material consists of, is not a description of that material." Office 
Action at 8. He concludes that "[i]n the current situation, the definition of the nucleic acids as 
having 95%-99% sequence identity to SEQ ID NO: 7 lacks any specific structure, since it lacks 
the correlation between structure and function that is at the heart of the caselaw and of the 
written description guidelines." Id. 

The Examiner next cites Fiers v. Sugano, 984 F.2d 1164, 25 U.S.P.Q.2d 1601 (Fed. Cir. 

1993) quoting the opinion as stating: 

"...if inventor is unable to envision detailed chemical structure of DNA sequence 
coding for specific protein, as well as method of obtaining it, then conception is 
not achieved until reduction to practice has occurred, that is, until after gene has 
been isolated.. .conception of any chemical substance, requires definition of that 
substance other than by its functional utility." Office Action at 9. 

The Examiner concludes that "the current situation is a definition of the compound without 
identifying the structure function relationship of the compound, so that the compound is claimed 
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solely its protein sequence related 80%-99% to SEQ ID NO: 7 without any correlative function 
to delimit the structure." 

Finally, the Examiner cites Vas-Cath, Inc. v. Mahurkar, 935 F.2d 1555, 19 U.S.P.Q.2d 
1111 (Fed. Cir. 1991), quoting the court as stating: "...applicant must also convey, with 
reasonable clarity to those skilled in art, that applicant, as of filing date sought, was in possession 
of invention, with invention being, for purposes of "written description" inquiry, whatever is 
presently claimed." Office Action at 9. The Examiner concludes that "[i]n the application at the 
time of filing, there is no record or description which would demonstrate conception of any 
proteins other than those expressly disclosed which comprise SEQ ID NO 7." Id. 

5. Appellants' Response - Rejected Claims 48-54 are Adequately Described 
As noted above, whether the applicants were in possession of the invention as of the 
effective filing date of an application is a factual determination, reached by the consideration of a 
number of factors, including the level of knowledge and skill in the art, and the teaching 
provided by the specification. The inventor is not required to describe every single detail of 
his/her invention. An applicant's disclosure obligation varies according to the art to which the 
invention pertains since the adequacy of a description is assessed from the view of one of skill in 
the art. The present invention pertains to the field of recombinant DNA/protein technology. It is 
well-established that the level of skill in this field is very high since a representative person of 
skill is generally a Ph.D. scientist with several years of experience. Accordingly, the teaching 
imparted in the specification must be evaluated through the eyes of a highly skilled artisan as of 
the date the application was filed. 

As amended, the pending claims are related to isolated polypeptides having at least 95%, 
96%, 97%, 98% or 99% amino acid sequence identity to SEQ ID NO: 7 or the amino acid 
sequence of the polypeptide encoded by the full-length coding sequence of the cDNA deposited 
under ATCC accession number 203661, and which satisfy the limitation "wherein said isolated 
polypeptide or a fragment thereof can be used to generate an antibody which can be used to 
specifically detect the polypeptide of SEQ ID NO:7 in lung or colon tissue samples." Support 
for these claims is found throughout the substitute specification. See, e.g., Specification at 11, 
lines 21-28; page 30, lines 16-21; page 32, line 31 through page 33, line 19; page 88, lines 4-5; 
page 95, lines 8-10; and page 108, lines 20-27. 
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a. The encompassed zenus is adequately described 

Appellants first address the Examiner's argument that the claims encompass a large 
genus for which no common elements or attributes are disclosed. Appellants maintain that there 
is no substantial variation within the species which fall within the scope of the rejected claims, 
which require at least 95% to 99% amino acid sequence identity to SEQ ID NO: 7 and can be 
used to generate antibodies which specifically detect the polypeptide of SEQ ID NO:7 in lung or 
colon tissue samples. As such, Appellants were in possession of the common attributes or 
features of the claimed subject matter. 

The rejected claims are analogous to the claims discussed in Example 14 of the written 
description training materials available on the PTO's website. In Example 14, the written 
description requirement was found to be satisfied for claims directed to polypeptides with 95% 
homology to a disclosed sequence that also possess a recited catalytic activity, where procedures 
for making variant proteins were routine in the art and the specification provided an assay for 
detecting the recited catalytic activity of the protein. This disclosure satisfies the written 
description requirement even though the applicant had disclosed only a single species and had 
not made any variants . The Guidelines state that "[t]he single species disclosed is representative 
of the genus because all members have at least 95% structural identity with the reference 
compound and because of the presence of an assay which applicant provided for identifying all 
of the at least 95% identical variants of SEQ ID NO: 3 which are capable of the specified 
catalytic activity." 

Similarly, the pending claims also have very high sequence homology to the disclosed 
sequence and must share an epitope sufficient to generate antibodies which specifically detect 
the polypeptide of SEQ ID NO: 7 in lung or colon tissue samples. As in Example 14, at the time 
of effective filing date of the instant application, it was well known in the art how to make 
polypeptides with at least 95% amino acid sequence identity to the disclosed sequences. See, 
e.g., Specification at 71, line 43-47. In addition, the specification discloses in detail how to make 
antibodies which specifically detect a particular PRO polypeptide, and how to use them to detect 
the PRO polypeptide in a particular tissue. See, e.g., Specification at 95, lines 8-21; and page 
124, lines 0-25. Like a particular catalytic activity, the function of being useful to produce an 
antibody specific to SEQ ID NO: 7 is directly related to the structure of the claimed polypeptide. 
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Thus, like Example 14, the genus of polypeptides that have at least 95% to 99% amino acid 
sequence identity to the disclosed sequences and possess the described functional activity are 
adequately described. 

Furthermore, while Appellants appreciate that actions taken by the PTO in other 
applications are not binding with respect to the examination of the present application, 
Appellants note that the PTO has issued many patents containing claims to variant nucleic acids 
or variant proteins where the applicants did not actually make such nucleic acids or proteins. 
Representative patents include U.S. Patent No: 6,737,522, U.S. Patent No. 6,395,306, U.S. 
Patent No. 6,025,1 56, U.S. Patent No. 6,645,499, U.S. Patent No. 6,498,235, and U.S. Patent No. 
6,730,502. 

b. The rejected claims do not encompass alternately spliced versions of the 
proteins, etc. for which no written description is provided 
Appellants respond next to the Examiner's argument that the claimed polypeptides 
encompass "alternately spliced versions of the proteins, allelic variants including insertions and 
mutations, inactive precursor proteins which have a removable amino terminal end" and that 
"[n]o written description of alleles, of upstream or downstream regions containing additional 
sequence, or of alternative splice variants has been provided in the specification." Office Action 
at 8. 

The Examiner may be referring to the definition of a "native sequence PRO polypeptide" 
found on page 35, line 29 through page 36, line 3 of the specification. That definition states that 
"a 'native sequence PRO polypeptide' encompasses naturally-occurring truncated or secreted 
forms of the specific PRO polypeptide (e.g., an extracellular domain sequence), naturally- 
occurring variant forms (e.g., alternatively spliced forms) and naturally-occurring allelic variants 
of the polypeptide." Specification at 35, lines 31-34. 

However, Appellants note that the rejected claims as amended are directed to isolated 
polypeptides having at least 95% to 99% amino acid sequence identity to SEP ID NO: 7 or the 
amino acid sequence of the polypeptide encoded by the full-length coding sequence of the cDNA 
deposited under ATCC accession number 203661 . The claims at issue are not directed to "native 
sequence PRO polypeptides," and never have been. Therefore, the Examiner's arguments 
regarding the scope of the rejected claims is wrong, and his arguments are irrelevant. 
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c. The instant case is distinct from the cited court decisions 
Finally, Appellants address the Examiner's arguments based on the cited easel aw. 
Appellants submit that the present case differs from the facts in Regents of the University of 
California v. Eli Lilly, 119 F.3d 1559, 43 U.S.P.Q.2d 1398 (Fed. Cir. 1997). In Lilly, the claim 
was directed to a nucleic acid which "encodes insulin." Id, at 1401. The Court characterized 
that claim as a "definition by function" since it merely describes the claimed nucleic acid by 
what it does - encodes insulin - rather than by its structure, i.e., its sequence. Id. at 1406. 
Because nucleic acids which encode insulin were known to exist, but the claims at issue recited 
no structure, the Court held that "naming a type of material generally known to exist, in the 
absence of knowledge as to what the material consist of is not a description of that material." Id. 
at 1406 (emphasis added). The fact that the claims lacked any specific structure was important, 
and the Court distinguished the claims at issue from those which recited a specific DNA 
sequence. Id. n. 4 ("We note that in claims 4, 5, and 12-14 of the 6 740 patent [the claimed DNA] 
are described by reference to the structure of the claimed DNA sequences rather than by 
reference to their function."). 

Here, the claimed genus is defined not solely by its function, but by its structure and a 
correlated function - the claimed polypeptides must be at least 95% identical to SEQ ID NO: 7 
(structure) and satisfy the functional limitation "wherein said isolated polypeptide or a fragment 
thereof can be used to generate an antibody which can be used to specifically detect the 
polypeptide of SEQ ID NO:7 in lung or colon tissue samples." Thus, the facts in this case are 
not at all similar to the facts in Lilly, and the rejected claims do not lack "the correlation between 
structure and function that is at the heart of the caselaw and of the written description guidelines" 
as the Examiner asserts. Office Action at 8. 

The instant case is also different from Fiers v. Sugano, 984 F.2d 1164, 25 U.S.P.Q.2d 
1601 (Fed. Cir. 1993). In that case, the Court held that in the absence of actual reduction to 
practice and disclosure of the entire DNA sequence, recitation of "its functional utility" 
(encoding a beta-interferon) was not sufficient to describe the DNA. Id. at 1604. Unlike Fiers, 
the polypeptide in the instant case has been reduced to practice - the gene of SEQ ID NO: 6 has 
been cloned and sequenced and the polypeptide of SEQ ID NO: 7 has been disclosed. Therefore, 
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the claimed polypeptides are not being described merely by their "functional utility" as in Fiers, 
and the claimed polypeptides are delimited by a correlative function. 

Finally, for the reasons articulated above, it is clear that the Appellants were in 
possession of the invention as claimed in the rejected claims at the time of filing. Clearly, 
throughout the specification the Appellants articulated the conception of polypeptides that were 
at least 95% identical to a particular PRO polypeptide, including PR0539, as well as antibodies 
which were specific to PR0539. See, e.g., Specification at 1 1, lines 21-28; page 30, lines 16-21; 
page 32 line 31 through page 33, line 19 (disclosing that the invention concerns polypeptides 
with at least 95% sequence identity to PR0539, and antibodies which specifically bind to them.) 
Thus, the Examiner's statement that "there is no record or description which would demonstrate 
conception of any proteins other than those expressly disclosed which comprise SEQ ID NO: 7" 
is simply not true. Office Action at 9. 

6. The Examiner's Rebuttal Arguments Do Not Address the Rejected Claims 
The Examiner has offered two arguments in response to Appellants 5 arguments that the 
rejected claims are adequately described: (1) SEQ ID NO: 7 is not representative of the claimed 
genus, and (2) that there is no expectation of insubstantial variation because the recited function 
is not correlated to the structure. Office Action at 19-23. All of the Examiner's rebuttal 
arguments are directed to canceled Claims 35, 37-40 and 46-47 which recited the limitation 
"wherein said isolated polypeptide is overexpressed in lung or colon tumors compared to normal 
lung or colon tissue, respectively, or wherein said isolated polypeptide is encoded by a nucleic 
acid which is amplified in lung or colon tumor compared to normal lung or colon tissue, 
respectively." Nowhere , does the Examiner address Appellants' arguments regarding the claims 
on appeal, Claims 48-54, which have an entirely different functional limitation - "wherein said 
isolated polypeptide or a fragment thereof can be used to generate an antibody which can be used 
to specifically detect the polypeptide of SEQ ID NO:7 in lung or colon tissue samples." 

Turning to first to the Examiner's argument that the single disclosed sequence, SEQ ID 
NO:7, is not representative of the claimed genus. The Examiner states that the Appellants' 
arguments articulated above with respect to the similarity of the instant claims to Example 14 is 
flawed, arguing that: 

Applicant's analysis is flawed since there is no expectation in the instant case of 
insubstantial variation because the functional limitation devolves solely to the 
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ability of the protein to act as a reductase, but no specific substrate is required by 
the claims. The function provides very little guidance or information regarding 
the structure and does not delimit the structure in even the smallest or most 
miniscule possible way. So the argument by Applicant that there would be 
insubstantial variation is not correct since the function of "overexpression in lung 
or colon tumor cells" does not limit the protein in any significant way. Office 
Action at 21 

It is not clear what "reductase" limitation the Examiner is referring to, as the only claim 
limitations are those of overexpression in canceled Claim 35, and the limitation regarding the use 
of the claimed polypeptides to generate antibodies found in the claims at issue, Claims 48-54. 
Thus, the Examiner's arguments are not responsive to Appellants' arguments. 

However, even setting the non-responsive nature of the Examiner's arguments aside, the 

basic premise that a large genus can not be adequately described by a single species is simply 

wrong. In a recent Federal Circuit decision, In re Wallach, 378 F.3d 1330, 1333-34 (Fed. Cir. 

2004), the Court stated: 

[W]e agree with Appellants that the state of the art has developed such that the 
complete amino acid sequence of a protein may put one in possession of the genus 
of DNA sequences encoding it , and that one of ordinary skill in the art at the time 
the '129 application was filed may have therefore been in possession of the entire 
genus of DNA sequences that can encode the disclosed partial protein sequence, 
even if individual species within that genus might not have been described or 
rendered obvious. ... A claim to the genus of DNA molecules complementary to 
the RNA having the sequences encompassed by that formula, even if defined only 
in terms of the protein sequence that the DNA molecules encode, while 
containing a large number of species, is definite in scope and provides the public 
notice required of patent applicants . 

Moreover, we see no reason to require a patent applicant to list every possible 
permutation of the nucleic acid sequences that can encode a particular protein for 
which the amino acid sequence is disclosed, given the fact that it is, as explained 
above, a routine matter to convert back and forth between an amino acid sequence 
and the sequences of the nucleic acid molecules that can encode it. Id. (emphasis 
added). 

The Court did not require the applicants in Wallach to actually make or individually 
describe all of the vast number of sequences which encode the disclosed sequence. This is in 
spite of the fact that only a single sequence was disclosed, and the encompassed genus was 
enormous due to codon degeneracy in the genetic code. The Court reasoned that because it is 
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routine to convert between amino acid sequences to nucleic acid sequences, disclosure of a 
single amino acid sequence was sufficient to place the applicants in possession of the enormous 
genus of nucleic acids which could encode the sequence. 

The facts in Wallach are very similar to the instant case. Here, Appellants have disclosed 
SEQ ID NO: 7, and claim polypeptides which are at least 95% identical to it and have the 
functional limitation of the ability to generate antibodies which can be used to specifically detect 
SEQ ID NO: 7 in lung or colon tissue samples. As discussed above, it is routine in the art to 
create polypeptides which have at least 95% to 99% sequence identity to SEQ ID NO: 7 - it is 
just as predictable and easy as creating all of the nucleic acids which encode a particular amino 
acid sequence. Similarly, it is well within the knowledge of those skilled in the art how to 
determine which polypeptides can be used to make the recited antibodies. The predictability of 
this structure/function combination is sufficient to place the claimed subject matter in the 
possession of the Appellants, and thus the claimed polypeptides are adequately described. The 
Wallach opinion makes clear that there is no need to literally describe more than a single species 
to adequately describe a large genus where one of skill in the art recognizes that the disclosed 
species puts the applicant in possession of the claimed genus. 

Appellants next address the Examiner's argument that there is an "absence of any 
structure-function relationship." Office Action at 22. As explained above, the Examiner has 
never addressed the functional limitation in Claim 48-54 which is at issue on appeal: "wherein 
said isolated polypeptide or a fragment thereof can be used to generate an antibody which can be 
used to specifically detect the polypeptide of SEQ ID NO:7 in lung or colon tissue samples." 

Clearly, as one of skill in the art will recognize, the ability of a polypeptide to generate an 
antibody which recognizes a particular target molecule is intimately related to the structure of the 
polypeptide. Therefore, the Examiner's unsupported argument that there is no structure-function 
relationship in the claims at issue is simply wrong. 

7. Each of Rejected Claims 48-52 Must Be Considered Individually 

The adequacy of written description support is a factual issue and is to be determined on 
a case-bv-case basis . See e.g., Vas-Cath, Inc. v. Mahurkar, 935 F.2d at 1563, 19 U.S.P.Q.2d at 
1116 (Fed. Cir. 1991) (emphasis added). The factual determination in a written description 
analysis depends on the nature of the invention and the amount of knowledge imparted to those 
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skilled in the art by the disclosure. Union Oil v. Atlantic Richfield Co., 208 F.3d 989, 996 (Fed. 
Cir. 2000). 

As discussed above, the Examiner has failed to specifically address Claims 48-54 which 
differ significantly from canceled Claim 35, the claim which the Examiner has addressed. 
However, even beyond addressing Claims 48-54 as a group, the Examiner must address each of 
Claims 48-52 individually, on a case-by-case basis. This is because the encompassed genus of 
each claim is different, and therefore whether SEQ ID NO:7 is representative of the genus 
depends on the claim at issue. Appellants hereby request that the Board consider the each of 
Claims 48-52 individually with respect to the written description requirement. 

a. Rejected Claim 48 is adequately described 

Because SEQ ID NO:7 is 830 amino acids long, a polypeptide which is at least 95% 
identical can only have approximately 42 deletions or substitutions. For the reasons discussed 
above, Appellants believe that independent Claim 48 is adequately described. 

b. Rejected Claim 49 is adequately described 

For the reasons discussed above, Appellants believe that dependent Claim 49 is 
adequately described. However, because the genus of polypeptides encompassed by Claim 49 is 
smaller than that of Claim 48, (a polypeptide which is at least 96% identical can only have 
approximately 33 deletions or substitutions) the adequacy of the disclosure of the species SEQ 
ID NO:7 is more apparent to those of skill in the art. 

c. Rejected Claim 50 is adequately described 

For the reasons discussed above, Appellants believe that dependent Claim 50 is 
adequately described. However, because the genus of polypeptides encompassed by Claim 50 is 
smaller than that of Claims 48, or 49, (a polypeptide which is at least 97% identical can only 
have approximately 25 deletions or substitutions) the adequacy of the disclosure of the species 
SEQ ID NO:7 is more apparent to those of skill in the art. 
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d. Rejected Claim 51 is adequately described 

For the reasons discussed above, Appellants believe that dependent Claim 51 is 
adequately described. However, because the genus of polypeptides encompassed by Claim 51 is 
smaller than that of Claims 48-50, (a polypeptide which is at least 98% identical can only have 
approximately 17 deletions or substitutions) the adequacy of the disclosure of the species SEQ 
ID NO:7 is more apparent to those of skill in the art. 

e. Rejected Claim 52 is adequately described 

For the reasons discussed above, Appellants believe that dependent Claim 52 is 
adequately described. However, because the genus of polypeptides encompassed by Claim 52 is 
smaller than that of Claims 48-51, (a polypeptide which is at least 99% identical can only have 
approximately 8 deletions or substitutions) the adequacy of the disclosure of the species SEQ ID 
NO:7 is more apparent to those of skill in the art. 

/ Rejected Claims 53-54 are adequately described 
For the reasons discussed above, Appellants believe that dependent Claims 53-54 are 
adequately described. However, because the genus of polypeptides encompassed by Claims 53- 
54 is smaller than that of independent Claim 48, the adequacy of the disclosure of the species 
SEQ ID NO:7 is more apparent to those of skill in the art. 

8. Written Description — Conclusion 

In conclusion, the Board should reverse the Examiner's written description rejection of 
Claims 48-54, as a group and individually because the Examiner has failed to rebut the 
presumption that the claims are adequately described, as he has failed to even address the claims 
at issue. And even if the Examiner's arguments are extended to the claims at issue, Appellants 
submit that they have satisfied the written description requirement for rejected Claims 48-54 
based on the actual reduction to practice of SEQ ID NO: 7, by specifying a high level of amino 
acid sequence identity, and by describing how to make and use antibodies to the disclosed 
sequence. These facts are directly analogous to those of Example 14 of the Written Description 
Guidelines published by the PTO. In addition, like In re Wallach, the description of the single 
species SEQ ID NO: 7 is sufficient to place the Appellants' in possession of the claimed genus 
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because those of skill in the art recognize the correlation between polypeptide structure and the 
ability to generate specific antibodies. Appellants submit that the instant disclosure allows one 
of skill in the art to "recognize that the applicant was in possession of the necessary common 
attributes or features of the elements possessed by the members of the genus." Hence, 
Appellants respectfully request that the Board reverse the Examiner's written description 
rejection of Claims 48-54 under 35 U.S.C. §112, first paragraph. 

D. Enablement Rejection - Detailed Arguments 

The third issue before the Board is whether Appellants have enabled the pending claims 
such that one of skill in the art would be able to make and use the claimed invention. The 
Examiner has rejected Claims 27-28, 32-34 and 48-54 under 35 U.S.C. §112, first paragraph, 
arguing that the claimed subject matter was not described in the specification in such a way as to 
enable one skilled in the art to make and/or use the invention. Office Action at 10. The 
Examiner recites the factors for determining enablement from In re Wands, 858 F.2d 731, 8 
U.S.P.Q. 2d 1400 (Fed. Cir. 1988), relying to a large extent on the arguments made above in 
support of the rejection for lack of utility. 

Appellants submit that Claims 27-28, 32-34 and 48-54 are enabled such that one of skill 
in the art could make and use the claimed polypeptides without undue experimentation. With 
respect to Claims 27-28 and 32-34, how to make the polypeptide of SEQ ID NO: 7 and the 
polypeptide encoded by the cDNA deposited under ATCC accession number 203661 is within 
the skill in the art. Similarly, with respect to Claims 48-54, it is well within the skill of those in 
the art to make polypeptides that are at least 95% identical to SEQ ID NO: 7 and the polypeptide 
encoded by ATCC 203661, and it is well within those of skill in the art to make antibodies which 
are specific to a disclosed sequence. See also In re Wands, 858 F.2d 731 (reversing the Board's 
decision of non-enablement and holding that as of 1980, undue experimentation was not required 
to make high-affinity monoclonal antibodies to a target peptide). Thus, one of skill in the art 
would be able to make the claimed polypeptides without undue experimentation, and the 
Examiner has not made any arguments to the contrary. 

As described above, Appellants assert that the claimed polypeptides are useful as 
diagnostic tools for cancer, particularly lung and colon cancer. This use is based on the 
disclosure in Example 16 of the instant application that the PR0539 gene is amplified at least 
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two-fold in the majority of the lung and colon tumors tested. As detailed above, it is well- 
established that gene amplification is correlated with an overexpression of the corresponding 
mRNA and the encoded polypeptide, and thus it is likely that the PR0539 polypeptide is 
overexpressed in lung and colon cancer. Thus, based on the disclosure in the application, one of 
skill in the art would be able to use the claimed polypeptides as diagnostic tools to distinguish 
suspected lung and colon tumors from normal tissue without undue experimentation. 

1. Enablement — Legal Standard 

An application enables the claims "if one skilled in the art, after reading the[] disclosure[], 
could practice the invention claimed ... without undue experimentation." Chiron Corp. v. 
Genentech, Inc., 363 F.3d 1247, 1253 (Fed. Cir. 2004). "But the question of undue 
experimentation is a matter of degree. The fact that some experimentation is necessary does not 
preclude enablement; what is required is that the amount of experimentation 'must not be unduly 
extensive."' PPG Indus., Inc. v. Guardian Indus., Corp., 75 F.3d 1558, 1564 (Fed. Cir. 1996) 
(quoting Atlas Powder Co. v. E.I DuPont de Nemours & Co., 750 F.2d 1569, 1576 (Fed. Cir. 
1984)). 

While the application must enable one of ordinary skill in the art to practice the full scope 
of the claimed invention, "[t]hat is not to say that the specification itself must necessarily 
describe how to make and use every possible variant of the claimed invention, for the artisan's 
knowledge of the prior art and routine experimentation can often fill gaps, interpolate between 
embodiments, and perhaps even extrapolate beyond the disclosed embodiments, depending upon 
the predictability of the art." AK Steel Corp. v. Sollac, 344 F.3d 1234, 1244 (Fed. Cir. 2003). 

2. Enablement - Burden of Proof 

In order to make an enablement rejection, the PTO has the initial burden to establish a 
reasonable basis to question the enablement provided for the claimed invention. See M.P.E.P. § 
2164.04. A specification teaching how to make and use the claimed subject matter must be taken 
as being in compliance with the enablement requirement unless there is a reason to doubt the 
objective truth of the statements contained therein which are relied on for enabling support. Id. 
It is incumbent for the PTO "to explain why it doubts the truth or accuracy of any statement in a 
supporting disclosure and to back up assertions of its own with acceptable evidence or reasoning 
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which is inconsistent with the contested statement." Id. (quoting In re Marzocchi, 439 F.2d 220, 
224, 169 U.S.P.Q. 367, 370 (C.C.P.A. 1971). This can be done "by making specific findings of 
fact, supported by the evidence, and then drawing conclusions based on these findings of fact." 
Id. 

3. Enablement - Standard of Proof 

Once the examiner has weighed all the evidence and established a reasonable basis to 
question the enablement provided for the claimed invention, the burden falls on the applicant to 
present persuasive arguments, supported by suitable proofs where necessary, that one skilled in 
the art would be able to make and use the claimed invention using the application as a guide. See 
M.P.E.P. § 2164.05. "The evidence provided by applicant need not be conclusive but merely 
convincing to one skilled in the art." Id. (bold emphasis added, underline in original). "A 
declaration or affidavit is, itself evidence that must be considered ." Id. (emphasis in original). 

The examiner must then "weigh all the evidence before him or her, including the 
specification and any new evidence supplied by applicant with evidence and/or sound scientific 
reasoning previously presented in the rejection and decide whether the claimed invention is 
enabled." Id. "The examiner should never make the determination based on personal opinion. 
The determination should always be based on the weight of all the evidence." Id. (emphasis in 
original). 

4. Appellants 9 Specification Teaches How to Make and Use the Claimed Subject 
Matter 

The claimed subject matter relates to polypeptides of SEQ ID NO: 7 and the polypeptide 
encoded by ATCC deposit 203661, and polypeptides which are at least 95% identical to those 
polypeptides and which can be used to make antibodies that specifically detect PR0539 in lung 
or colon tissue. The specification discloses how to make the claimed polypeptides, for example 
in Examples 23-26 of the specification, and at page 73, line 36 through page 79, line 20. In 
addition, methods for making polypeptides that are at least 95% identical to SEQ ID NO: 7 by 
making substitutions or deletions are also disclosed in the specification and well known in the art. 
See e.g., Specification at page 69, line 37 through page 72, line 9. Methods for making and 
testing antibodies for specificity are well known in the art, and are disclosed in the specification, 
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including Example 27 on page 124 of the specification which specifically describes the 
preparation of antibodies that bind PRO polypeptides. See, e.g. Specification at page 87, line 35 
through page 95, line 6; and page 124, lines 0-25. In addition, the specification discloses that 
antibodies to claimed polypeptides can be used in diagnostic assays to detect the expression of 
PR0539 in specific types of tissue. Specification at 95, lines 8-21. In light of the amplification 
of the PR0539 gene in a majority of lung and colon tumors tested, one of skill in the art would 
expect the PR0539 polypeptide to overexpressed in these tumors as well. Therefore, given the 
teaching in the specification on how to make and use the claimed polypeptides to detect 
expression of PR0539 in specific tissues, one of skill in the art would be enabled to practice the 
claimed invention without undue experimentation. 

Because Appellants' specification teaches how to make and use the claimed subject 
matter, it must be taken as being in compliance with the enablement requirement unless there is a 
reason to doubt the objective truth of the statements contained therein which are relied on for 
enabling support. SeeM.P.E.P. § 2164.04. Id. 

5. The Examiner's Arguments and Appellants 9 Responses 

The PTO has the initial burden to establish a reasonable basis to question the enablement 
provided for the claimed invention. See M.P.E.P. § 2164.04. It is incumbent for the PTO "to 
explain why it doubts the truth or accuracy of any statement in a supporting disclosure and to 
back up assertions of its own with acceptable evidence or reasoning which is inconsistent with 
the contested statement." Id. (quoting In re Marzocchi, 439 F.2d 220, 224, 169 U.S.P.Q. 367, 
370 (C.C.P.A. 1971). This can be done "by making specific findings of fact, supported by the 
evidence , and then drawing conclusions based on these findings of fact." Id. 

The Examiner has relied on the factors to be considered in determining whether a 
disclosure meets the enablement requirement as described in In re Wands, 858 F.2d 731, 8 
U.S.P.Q. 2d 1400 (Fed. Cir. 1988). The Examiner's arguments are all directed at the "use" 
prong of the enablement question. Essentially, the Examiner is arguing that the because there is 
no "necessary" correlation between gene amplification and mRNA or protein overexpression, 
one of skill in the art would not know what to use the claimed subject matter for, rather than how 
to use the claimed subject matter. The Examiner's arguments are essentially an enablement 
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rejection based on his asserted lack of utility. For the same reasons that the utility rejection 
should be reversed, the enablement rejection should be reversed as well. 

Appellants state the Examiner's arguments regarding each Wand factor and respond to 
each in turn below. 

a. The nature of the invention 

The Examiner argues that the invention is "in a class of invention which the CAFC has 
characterized as 'the unpredictable arts such as chemistry and biology.' Mycogen Plant Sci., Inc. 
v. Monsanto Co., 243 F.3d 1316,1330 (Fed. Cir. 2001)." Office Action at 10. 

While Appellants acknowledge that some aspects of biology can be unpredictable, the 
expression of polypeptides and the creation of specific antibodies to a disclosed antigen is not 
unpredictable. This issue was addressed in In re Wands, 858 F.2d 731, 8 U.S.P.Q. 2d 1400 (Fed. 
Cir. 1988), where the CAFC held that the disclosure was sufficient to enable one of skill in the 
art to make monoclonal antibodies to a disclosed antigen without undue experimentation. Id. at 
740. If the disclosure was sufficient at the time of filing of the Wands application in 1980, it 
cannot be that the art of expressing polypeptides and making antibodies has become less 
predictable in the ensuing 25 years. Similarly, the use of antibodies to detect a target peptide 
such as PR0539 in a particular tissue is similarly well-established in the art. See Specification at 
95, lines 8-21, and the cited references which are incorporated by reference into the specification. 

Thus, contrary to the Examiner's assertions, the nature of the invention weighs in favor of 
enablement as it is a predictable and well-established aspect of biology. 

b. The breadth of the claims 

The Examiner argues that "[t]he claims broadly encompass not only a particular 
PRO-539 protein but also include any protein which shares 95% sequence identity to SEQ ID 
NO: ir Office Action at 11. 

The broadest pending claim is Claim 48, which recites: 

48. An isolated polypeptide having at least 95% amino acid sequence identity to: 

(a) the amino acid sequence of the polypeptide of SEQ ID NO:7; or 

(b) the amino acid sequence of the polypeptide encoded by the full-length 
coding sequence of the cDNA deposited under ATCC accession number 203661; 
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wherein said isolated polypeptide or a fragment thereof can be used to generate an 
antibody which can be used to specifically detect the polypeptide of SEP ID NO: 7 in 
lung or colon tissue samples . 

Thus, only those polypeptides which share an epitope with SEQ ID NO: 7, such that an 
antibody to that epitope can specifically detect PR0539 in lung or colon tissue samples is 
encompassed by the broadest claims. This is not as broad a genus as "any protein which shares 
95% sequence identity to SEQ ID NO: 7" as the Examiner alleges, but is instead limited to a 
well-defined genus which binds a specific disclosed polypeptide. 

In addition, dependent Claims 49-52 further limit the percent identity to at least 96-99%. 
A polypeptide that is at least 99% identical to SEQ ID NO: 7 can only have approximately 8 
substitutions or deletions. Finally, for Claims 27-28 and 32-34, the genus is limited to SEQ ID 
NO: 7 or the polypeptide encoded by ATCC deposit 203661 . The Examiner has failed to address 
each of these more narrow claims independently as each one is more narrow in scope than the 
Examiner has alleged. 

Thus, contrary to the Examiner's assertions, the breath of the claims weighs in favor of 
enablement, particularly for the narrower dependent Claims 49-52 and Claims 27-28 and 32-34. 

c. The quantity of experimentation 

The Examiner argues that the quantity of experimentation would be extremely large 
"since there is significant variability in the activity of polypeptides and nucleic acids. It would 
require significant study to identify the actual function of the PRO-539 protein and nucleic acid , 
and identifying a use for this protein would be an inventive, unpredictable and difficult 
undertaking in itself." Office Action at 1 1 (emphasis added). 

As Appellants discussed above in response to similar arguments by the Examiner, the 
asserted use of the claimed polypeptides does not rely on knowing the function of the PR0539 
protein or nucleic acid. Instead, the claimed polypeptides can be used as diagnostic tools for 
cancer by detecting the overexpression of the PR0539 polypeptide in lung and colon tumors. 
This use in no way depends on the function of the PR0539 protein, and therefore experiments to 
determine the function of PR0539 are unnecessary. 

In addition, as discussed above with respect to the nature of the invention, making the 
claimed polypeptides and antibodies which specifically bind to PR0539 to detect its expression 
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does not require a great deal of experimentation, as the techniques are well-established in the art. 
See In re Wands, 858 F.2d 731 (reversing the Board's decision of non-enablement and holding 
that as of 1980, undue experimentation was not required to make high-affinity monoclonal 
antibodies to a target peptide). 

Thus, contrary to the Examiner's assertions, the quantity of experimentation is minimal, 
and this factor weighs in favor of enablement. 

d. The unpredictability of the art and the state of the prior art 

Echoing the arguments regarding quantity of experimentation, the Examiner argues that 
"[t]he art is extremely unpredictable with regard to protein function in the absence of reliable 
information regarding the protein activity." Office Action at 1 1. The Examiner again cites Rost 
et al. 9 arguing that proteins with different sequences may have very different functions. He 
argues that because the function of the PR0539 protein is not known, "it is entirely unpredictable 
what function and activity will be found for this protein," and there is no prior art to resolve this 
ambiguity. Id. 

The Examiner then repeats his arguments in support of the utility rejection, stating that 
"the overexpression of the nucleic acid is not relevant to the utility of the protein" and that 
"[t]here is no evidence that the protein itself is overexpressed." Office Action at 11. The 
Examiner repeats verbatim the arguments made above regarding a lack of a "necessary" 
connection between gene amplification and mRNA and protein levels, citing the Meric et al, 
Gokman-Polar et al and Pennica et al references. Office Action at 11-13. The Examiner 
concludes by stating that "it is entirely unpredictable how one would use this protein in any 
context whatsoever." Office Action at 13. 

Appellants have respond at length to these arguments above, and refer the Board to the 
discussion of the utility rejection supra. To briefly summarize those arguments, Appellants have 
demonstrated that the references relied on by the Examiner do not support his position, and 
instead support the Appellants' arguments. Appellants have provided substantial evidence in the 
form of additional data, expert declarations, and references which show that it is well-established 
and predictable that the more than two-fold amplification of the PR0539 gene in lung and colon 
tumors will result in overexpression of the PR0539 protein in those tumors. Therefore given the 
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disclosure in the specification, one of skill in the art would be able to use the claimed 
polypeptides as diagnostic tools for lung and colon cancer. 

Thus, contrary to the Examiner's assertions, the prior art regarding the correlation 
between gene amplification and protein overexpression is sufficiently predictable that this factor 
weighs in favor of enablement. 

e. Working examples 

The Examiner asserts that the specification has no working examples that relate to the 
protein, and repeats his arguments that the data showing overexpression of the nucleic acid in 
certain cancer cell lines are not relevant. He argues that "there is no statistical showing that the 
overexpression of the nucleic acids is even significant" and "there is no showing that the results 
from nucleic acids have any correlation with the protein and the art cited above demonstrates that 
there is no presumption of such a correlation." Office Action at 13. 

First, Appellants have submitted the declaration of Dr. Goddard which states that the data 
in Example 16 are sufficiently reliable and significant such that the PR0539 gene can be used as 
a cancer diagnostic tool. See Goddard Declaration at ^[7. Appellants remind the Board that 
"The evidence provided by applicant need not be conclusive but merely convincing to one 
skilled in the art" and that " [a] declaration or affidavit is. itself evidence that must be 
considered ." M.P.E.P. § 2164.05 (emphasis in original). The Examiner has not presented any 
evidence or argument in response to the Goddard Declaration, and therefore Appellants have 
established that the data regarding amplification of the PR0539 gene is significant. As to the 
Examiner's argument that there is no correlation between the gene amplification data and the 
protein and antibody based on the three cited references, this argument has been addressed above. 

Appellants also note that contrary to the Examiner's assertion, the specification describes 
in detail how to express PRO polypeptides and make antibodies which can specifically bind PRO 
polypeptides. See, e.g., Specification at 118, line 23 through 124, line 25. In addition, the 
specification teaches how to use antibodies to a PRO polypeptide to detect its expression in 
specific cell or tissue types. Specification at 95, lines 8-21 . 

Thus, contrary to the Examiner's assertions, there are sufficient working examples and 
teachings related to the claimed polypeptides that this factor weighs in favor of enablement. 
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/ Guidance in the specification 

The Examiner summarily states that u [t]he specification provides no specific or 
substantial uses for the PRO-539 protein." Office Action at 13. 

Contrary to this assertion, Example 16 which shows a more than two-fold amplification 

of PR0539 in lung and colon tumors teaches that: 

Amplification is associated with overexpression of the gene product, indicating 
that the polypeptides are useful targets for therapeutic intervention in certain 
cancers such as colon, lung, breast and other cancers and diagnostic determination 
of the presence of those cancers . Specification at 108, lines 22-25 (emphasis 
added). 

In addition, the specification teaches that: 

[A]nti-PRO antibodies may be used in diagnostic assays for PRO, e.g.. detecting 
its expression in specific cells, tissues, or serum . Various diagnostic assay 
techniques known in the art may be used, such as competitive binding assays, 
direct or indirect sandwich assays and immunoprecipitation assays conducted in 
either heterogeneous or homogeneous phases [Zola, Monoclonal Antibodies: A 
Manual of Techniques . CRC Press, Inc. (1987) pp. 147-158]. Specification at 95, 
lines 9-13. 

This claim is supported by substantial evidence discussed above with respect to utility. 
Together, with the level of knowledge in the art and the disclosure of how to make antibodies 
which are specific to PR0539, the specification clearly teaches that the claimed polypeptides 
related to PR0539 are useful as diagnostic tools for lung and colon cancer, and how such use can 
be accomplished. In addition, there is extensive disclosure throughout the specification on how 
to make and use the claimed polypeptides and related antibodies. See, e.g., Specification at page 
69, line 37 through page 72, line 9; page 73, line 36 through page 79, line 20; page 87, line 35 
through page 95, line 6; page 95, lines 8-21; and page 118, line 23 through page 124, line 25. 

Thus, contrary to the Examiner's assertions, the specification provides sufficient 
guidance on how to use the claimed polypeptides such that this factor weighs in favor of 
enablement. 



g. Level of skill in the art 
Appellants agree with the Examiner that the level of skill in the art is high, with the 
typical skilled artisan having a doctorate degree and several years of post-graduate work. Thus, 
this factor weighs heavily in favor of enablement. 
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6. Grouping of Rejected Claim 

For purposes of the enablement rejection, Claims 27-28, and 32-34 can be considered as a 
group, Claims 48 and 53-54 can be considered as a group, and each of Claims 49-52 should be 
considered individually. 

a. Claims 27-28 and 32-34 are enabled 

Claims 27-28 and 32-34 are enabled for the reasons discussed above. The scope of these 
claims is narrow, and because SEQ ID NO: 7 and ATCC deposit 203661 are explicitly disclosed 
in the specification, no experimentation of any kind is required to make the claimed polypeptides. 
One of skill in the art would clearly be able to use these polypeptides to make antibodies which 
are specific to either of these polypeptides, such that the level of expression of these polypeptides 
could be assessed in lung or colon tissue. The only question is whether the use of these 
polypeptides to make antibodies to detect their expression level is a substantial and specific 
utility. For the reasons discussed at length above, Appellants believe that the amplification of 
the PR0539 gene in lung and colon tumors provides the claimed polypeptides with a substantial 
and specific utility. Therefore, Claims 27-28 and 32-34 are enabled. 

b. Claims 48 and 53-54 are enabled 

Claims 48 and 53-54 are enabled for the reasons discussed above. The scope of these 
claims is broader than that of Claims 27-28 and 32-34. Because SEQ ID NO:7 is 830 amino 
acids long, a polypeptide which is at least 95% identical can only have approximately 42 
deletions or substitutions. While some experimentation will be required to make these 
polypeptides that are not identical to SEQ ID NO:7, any such experimentation is routine in the 
art and will not be undue. One of skill in the art would clearly be able to use these polypeptides 
to make antibodies which are specific to SEQ ID NO:7, such that the expression level of 
PR0539 can be assessed in lung or colon tissue. The only question is whether the use of these 
polypeptides to make antibodies to detect the expression level of PR0539 is a substantial and 
specific utility. For the reasons discussed at length above, Appellants believe that it is, and 
therefore, Claims 48 and 53-54 are enabled. 
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c. Claim 49 is enabled 

Claim 49 is enabled for the reasons discussed above. The scope of this claim is narrower 
than that of Claim 48 since a polypeptide which is at least 96% identical can only have 
approximately 33 deletions or substitutions. As a result, less experimentation will be required to 
make these polypeptides, although any experimentation remains routine. One of skill in the art 
would clearly be able to use these polypeptides to make antibodies which are specific to SEQ ID 
NO:7, and for the reasons discussed above, Appellants believe that such use is substantial and 
specific. Therefore, Claim 49 is enabled. 

d. Claim 50 is enabled 

Claim 50 is enabled for the reasons discussed above. The scope of this claim is narrower 
than that of Claims 48-49 since a polypeptide which is at least 97% identical can only have 
approximately 25 deletions or substitutions. As a result, less experimentation will be required to 
make these polypeptides, although any experimentation remains routine. One of skill in the art 
would clearly be able to use these polypeptides to make antibodies which are specific to SEQ ID 
NO:7, and for the reasons discussed above, Appellants believe that such use is substantial and 
specific. Therefore, Claim 50 is enabled. 

e. Claim 51 is enabled 

Claim 51 is enabled for the reasons discussed above. The scope of this claim is narrower 
than that of Claims 48-50 since a polypeptide which is at least 98% identical can only have 
approximately 17 deletions or substitutions. As a result, less experimentation will be required to 
make these polypeptides, although any experimentation remains routine. One of skill in the art 
would clearly be able to use these polypeptides to make antibodies which are specific to SEQ ID 
NO:7, and for the reasons discussed above, Appellants believe that such use is substantial and 
specific. Therefore, Claim 5 1 is enabled. 

/ Claim 52 is enabled 
Claim 52 is enabled for the reasons discussed above. The scope of this claim is narrower 
than that of Claims 48-51 since a polypeptide which is at least 99% identical can only have 
approximately 8 deletions or substitutions. As a result, less experimentation will be required to 
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make these polypeptides, although any experimentation remains routine. One of skill in the art 
would clearly be able to use these polypeptides to make antibodies which are specific to SEQ ID 
NO:7, and for the reasons discussed above, Appellants believe that such use is substantial and 
specific. Therefore, Claim 52 is enabled. 

7. Enablement - Conclusion 

For the reasons discussed above, all of the In re Wand factors weigh in favor of a finding 
of enablement. The Examiner has made no attempt to argue that one of skill in the art would be 
unable to make the claimed polypeptides. The remainder of his arguments are essentially those 
made in support of the utility rejection which fail for the reasons articulated above. Therefore, 
the Examiner has failed to meet his initial burden to establish a reasonable basis to question the 
enablement provided for the claimed invention. See M.P.E.P. § 2164.04. The Examiner has 
failed to back up his assertions "with acceptable evidence or reasoning which is inconsistent with 
the contested statement [of enablement]." Id. (quoting In re Marzocchi, 439 F.2d 220, 224, 169 
U.S.P.Q. 367, 370 (C.C.P.A. 1971). 

And even if the Examiner has met his burden, Appellants have presented persuasive 
arguments, supported by the evidence discussed above with respect to utility, that one skilled in 
the art would be able to make and use the claimed invention using the application as a guide. 
Appellants remind the Board that "[t]he evidence provided by applicant need not be conclusive 
but merely convincing to one skilled in the art." M.P.E.P. § 2164.05 (emphasis in original). 

Appellants submit that rejected Claims 27-28, and 32-34 should be considered as a group, 
Claims 48 and 53-54 should be considered as a group, and each of Claims 49-52 should be 
considered individually. This is because the scope of the subject matter in each of the groups 
differs, and therefore varying amounts of experimentation will be required to make and use each 
of the groups. However, Appellants submit that even for the group with the broadest coverage, 
Claims 48 and 53-54, any experimentation would be routine for those of skill in the art given the 
high level of sequence identity required for the claims. 

Considering all of the evidence provided by the Appellants to establish their asserted 
utility, along with the disclosure in the specification, the Board should find that Appellants have 
established that one of skill in the art would be able to make and use the claimed invention 
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without undue experimentation, and the Examiner's rejection of the pending claims as lacking an 
enabling disclosure should be reversed. 

E. Conclusion 

In view of the arguments presented above, Appellants submit that the specification as 
filed provides a specific, substantial and credible utility for the claimed polypeptides, that the 
specification as filed provides an adequate description of the claimed subject matter, and that one 
of skill in the art would be able to make and use the claimed polypeptides without undue 
experimentation. Appellants therefore request that the Board reverse the Examiners rejections 
under 35 U.S.C. §§101 and 112. 

Please charge any additional fees, including any fees for additional extension of time, or 
credit overpayment to Deposit Account No. 11-1410. 



Respectfully submitted, 



KNOBBE, MARTENS, OLSON & BEAR, LLP 




Attorney of Record 
Customer No. 30,3 1 3 
(619)235-8550 



1924473 
091205 
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VIII. APPENDIX A - CLAIMS ON APPEAL 

27. An isolated polypeptide comprising: 

(a) the amino acid sequence of the polypeptide of SEQ ID NO:7; or 

(b) the amino acid sequence of the polypeptide encoded by the full-length 
coding sequence of the cDNA deposited under ATCC accession number 203661 . 

28. The isolated polypeptide of Claim 27 comprising the amino acid sequence of the 
polypeptide of SEQ ID NO:7. 

32. The isolated polypeptide of Claim 27 comprising the amino acid sequence of the 
polypeptide encoded by the full-length coding sequence of the cDNA deposited under ATCC 
accession number 203661 . 

33. A chimeric polypeptide comprising a polypeptide according to Claim 27 fused to 
a heterologous polypeptide. 

34. The chimeric polypeptide of Claim 33, wherein said heterologous polypeptide is 
a tag polypeptide or an Fc region of an immunoglobulin. 

48. An isolated polypeptide having at least 95% amino acid sequence identity to: 

(a) the amino acid sequence of the polypeptide of SEQ ID NO:7; or 

(b) the amino acid sequence of the polypeptide encoded by the full-length 
coding sequence of the cDNA deposited under ATCC accession number 203661 ; 

wherein said isolated polypeptide or a fragment thereof can be used to generate an 
antibody which can be used to specifically detect the polypeptide of SEQ ID NO:7 in 
lung or colon tissue samples. 

49. The isolated polypeptide of Claim 48, wherein said amino acid sequence identity 
is at least 96%. 

50. The isolated polypeptide of Claim 48, wherein said amino acid sequence identity 
is at least 97%. 

51. The isolated polypeptide of Claim 48, wherein said amino acid sequence identity 
is at least 98%. 

52. The isolated polypeptide of Claim 48, wherein said amino acid sequence identity 
is at least 99%. 

53. A chimeric polypeptide comprising a polypeptide according to Claim 48 fused to 
a heterologous polypeptide. 
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54. The chimeric polypeptide of Claim 53, wherein said heterologous polypeptide is a 
tag polypeptide or an Fc region of an immunoglobulin. 
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IX. APPENDIX B - EVIDENCE 

Attached hereto is a copy of the evidence cited in Appellants' Brief. The list of evidence 
below is accompanied by a statement setting forth where in the record that evidence was entered 
into the record by the Examiner. 



Tab Reference 



Submitted 



Entered 



1 Declaration of Audrey 
Goddard, Ph.D. 



Benjamin Lewin, 
Genes V, 5th ed. 1994, 
pages 1196-1201 

Alitalo (Med. Biol., 
(1984) 62:304-317) 



4 Merlino et al. (J. Clin. 
Invest., (1985) 
75:1077-1079) 

5 Orntoft et al. 
(Molecular and 
Cellular Proteomics, 
(2002) 1:37-45), 

6 Hyman et al. (Cancer 
Research, (2002) 
62:6240-6245) 



Pollack et al. (PNAS, 
(2002) 99:12963- 
12968) 



Bahnassy et al. (BMC 
Gastroenterology, 
(2004) 4:22-34), 



Originally submitted with 
Amendment and Response to 
Office Action mailed June 
16, 2004 at page 16 
Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 1 

Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 2 

Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 3 

Originally submitted with 
Amendment and Response to 
Final Office Action mailed 
December 6, 2004 as Exhibit 
1 

Originally submitted with 
Amendment and Response to 
Final Office Action mailed 
December 6, 2004 as Exhibit 
2 

Originally submitted with 
Amendment and Response to 
Final Office Action mailed 
December 6, 2004 as Exhibit 
3 

Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 7 



Entered by Examiner in Final 
Office Action mailed 
September 7, 2004 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in 
Advisory Action mailed 
December 20, 2004 



Entered by Examiner in 
Advisory Action mailed 
December 20, 2004 



Entered by Examiner in 
Advisory Action mailed 
December 20, 2004 



Entered by Examiner in Final 
Office Action mailed April 25, 
2005 
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9 Blancato et al. (British 
Journal of Cancer, 
(2004) 90(8), 1612- 
1619) 

1 0 Declaration of Victoria 
Smith, Ph.D. 



1 1 Declaration by J. 

Christopher Grimaldi, 



1 2 Declaration of Paul 
Polakis, Ph.D. 



1 3 Bruce Alberts, et al. , 
Molecular Biology of 
the Cell (3 rd ed. 1994) 

1 4 Bruce Alberts, et al. , 
Molecular Biology of 
the Cell (4 th ed. 2002) 



15 Benj amin Lewin, 
Genes VI (1997), pages 
847-848 

16 Zhigang et al., World 
Journal of Surgical 
Oncology 2: 13 (2004) 

1 7 Meric et al., Molecular 
Cancer Therapeutics, 
vol. 1,971-979 (2002) 

18 U.S. Patent No. 
6,737,522 



19 U.S. Patent No. 
6,395,306 



Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 8 

Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 13 

Originally submitted with 
Amendment and Response to 
Office Action mailed June 
16, 2004 at page 17 
Originally submitted with 
Amendment and Response to 
Office Action mailed June 
16, 2004 at page 18 
Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 9 

Originally submitted with 
Amendment and Response to 
Final Office Action mailed 
December 6, 2004 as Exhibit 
4 

Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 11 

Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 12 



Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 14 

Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 15 



Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in Final 
Office Action mailed 
September 7, 2004 

Entered by Examiner in Final 
Office Action mailed 
September 7, 2004 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in 
Advisory Action mailed 
December 20, 2004 



Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Cited by Examiner in Office 
Action mailed March 19, 2004 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 
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20 U.S. Patent No. 
6,025,156 



21 U.S. Patent No. 
6,645,499 



22 U.S. Patent No. 
6,498,235 



23 U.S. Patent No. 
6,730,502 



Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 16 

Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 17 

Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 18 

Originally submitted with 
Submission Filed With RCE 
mailed March 4, 2005 as 
Exhibit 19 



Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 

Entered by Examiner in Final 
Office Action mailed April 25, 
2005 
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X. APPENDIX C - RELATED PROCEEDINGS 

There are no decisions rendered by a court or the Board in any related proceedings 
identified above. 
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DECLARATION OF AUDREY D. GODDAJRB. Ph,D TTNDER 37 Cj.R. 8 1 .1 32 

Assistant Commissioner of Patents ■'. .' ■ 
Washington, D.C. 20231 \ .' . 



Sir: 



as follows: 



1. 



, Medical 



Affairs Department of Genentech, Inc., South San Francisco, California 940.80. 

. 2. . Berweenl993and2001,Ihe*dedmeDNAS^ 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic.process; ^ anddetemnnation 
of the chromosomal localization of novel genes. 

3, My scientific Curriculum Vitae,mcludmg my UrtofpimUcations, is attached to ari^ 
forms part of miiDeclaranon (E?diibit A).. ' •» 



■■ I . 4 ' 1 fyas ®* r ^ a ***** of techniqiira knoWi% e a?t for detecting and 
^tir^gtheamplffi^^ 

"gene ^amplification") assay described iki. a^ove ^ptfou^ patent applicaiioa 

5. The Tat^fan PCR assay is. deseribod, for example, in the following scientific : 
pubUcatioris: HigucM. et at, Biotechnology 10t41 3-4 17. (1992) ' (Exhibit Bfc.Livak * oL-PCK 
• Methods Appl,, 4:357-362 (1995) (Exhibit C)^nd Heid et Ml, Genome Res. ^986^994 (1996) . 
(ExhibitD). Briefly, the assay is based on the principle that success&ipCR yields a fluorescent? 
signal due to Taq DNA polymerase-mediated exonucle^e digestion of a; flubrescenfly labeled 
otigbnucleotide that ; is homologous to a sequence tietween two! PCR. primers. The extent of 
digestion depends directiyon^ 

increment m fluorescence that results from decreased e^^^^ This is an'extremely sensitive 

technique, which allows detection in the exponential phase of.thePCR reaction, and, as a result;' 
leads to accurate determination of gene copy number. ... . 

6. .The' quan^^rfiixiiescent' TaqMan. PCR assay has been extensively and 
successfully used to characterize genes involved in cancer ^^'dwelopment and progi^on; 
AmpUfication of protooncogenes has been studied in a variety of human timors, and is widely : 
considered as having etiological, diagnostic and prognostic signfficaoce/m use of the quantitative 
TaqMan.PCR assay is exemplified by the following scientific ^ publications: Pehnica e/.a£. Proc. 

,y«fl. Acad. Sci mk. 93(25)-i4717-14722 XWS) (Exhibit E);. . Pitti et a£ Nature 
396(6712);699,703(.19?8) (Exhibit F) andBieche et a/1, IhtXCancer 78^1^666 (1998X(Exhibit! 
G),the first two.of which l am co-author. parfc^^ quahtitative- 
TaqMan PCR assay to study relative gene amplification of WISP andc-mye:in various cell lines, 
colorectal tumors and normal mucosa. Pitti et. al r studied the genomic amplification of aidecoy 
receptor for Fas Ugahdk lung and colon cancer, using the quantitative.TaqMan PCR assay. Bieche 
et at used the assay to study gene amplification in breast, cancel 



■ i 



'■ : • v.* . 



/Serial No.: * 



7; It is iny personal: e>q>erience that the < 
..techmc^yse^vee^ 
control It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number hi a tumor tissue sampforelative to a normal (Lei, non W>4m P ie Vsignificani and: 
usefUmfo^ 

sample serves as abasisfor using relative gene copy number as quantitated by foeTaqMan PCR 
tecbjuqueas^daagnostocn^^^ 

pathology Accordingly, a gene identified as befog amplified at least- 2-fold by foe quantitative 
. TaqMah PCR assay in a tumor sample rejativefo a normal /sample is •useful as a marker for foe 
.diagnosis of c^ 



. ; : -8. I declare further aH statemenWn^e herein of niy dwn knowledge are true and 
all statemeins made o j ^ 

pumshablebyffoeormipri^ 

foe validity of the application' or any 



Code, 

patent issuing thereon. . 



Date 




Audfcey D. Goddafo£ PBJD. ! ' 



EXHIBIT A 



. . AUDREY D- GODDARD, fchJX ./ 



llOCongaSt ■*" 
San Francisco! CA, i9413t 
415.841,9154 =..-'".. ~ 
415.819^247 (mobile). . 
agoddard@pacbeli.net 



1993-present 

2001 -present; Senior Clinical Scientist. 
. Experimental Medicine /, BjoOncOlogy, Medical* Affairs . ; 

Responsibilities: ; [ [ 

• Companion diagnostic oncology products 

• Acquisition of clinical samples from Genetiiech's clinical trials for irvnslational research 

• Translational research using clinical specimen and data for drug development and 

• diagnostics . j- "» 

• Member of Development Science Review Committee, Diagnostic Oversight Team 21 CFR 
Part HSubteam " .'< ' \ *' 

% Interests:;- *\ ■ % - * : •"' 

; Ethical and legal implications of experiments with clinical specimens and data . 

> Application of pharmacogenorpicsin clinical trials. V 

1998- 2001 Senior Scientist 

Head of the DNA Sequencing. Laboratory, Molecular Bjology Department, Research- . 
Responsibilities: ; " : . : ; ' ?> 

• Management of a laboratory of up to ntnetew-ihcluding postdoctoral fellow, associate 
scientist, senior research. associate and research assistants/associate levels 

« Management of a $750Kbudget 
- • DNA sequencing core facility supporting a 350+ person research facility. 

> DNA sequencing for high throughput gene discovery, - £STs; cDNAs, and constructs , 
: * Genomic sequence analysis and gene identification 

/• DNA sequence and primary protein analysis 

'Research; 

« Chromosomal localization of novel genes 

• Identification and characterization, of genes contributing to the oncogenic process 

• identification *and characterization of genes contributing to inflammatory diseases 

• Design and development of schemes for high throughput genomic DNA sequence analysis 

• Candidate gene prediction and evaluation 



Genenfech, inc.. . y'. . X\ 

; 1 DNA Way /~ ' . 

.South Sari: ^iFraricisco, CA, S408Q \ 
65.0.225:6429; ... 
gpddarda@gene.com i. , . : : 

PROFESSIONAL EXPERIENCE 

Genentech, Inc. 
South San Francisco, CA 



:U. .-. 



\ \v ; * f-' Gpddird r Ph.D. ,V.page2°of ?* 

1993 - 1998 ' ;.i Scientist ' * y \ ... V, ..- m \ - : ' . . '/ 

Head of the DNA Sequencing Laboratory, Motecuiar Biology Department, Research " . 
; Responsibilities ' / ■• ',' .v. , ';\ 

+ DNA sequencing, core facility supporting a 3^0+ person research facility 

• iAssumecf responsibility for a pre-existing team qf five technicians and expanded the group 
s info fifteen,, introducing a /eve/ of middle management and additional areas of research ; 

• Participated in the development of the basic pfan for high throughput secreted protein V 
. discovery program- sequencing strategies, data analysis and p®cking,. daiabase 

• High throughput EST $nd cDNA sequencing for new gene identification. / 

• Design and implementation of analysis tools required for high throughput gene identification: 
+ Ghmmos.omalfocalizaiion 

Research:. • .\f" " •*".."•'-.'* -v ' .". -.' ' ■ ' \ : 

• Genomic sequence scanning for new. gene discovery. \ \ " • . 
> Development of signal peptide selection, methods. 

• Evaluation of candidate disease genes. ... 

• Gmwth hormone receptor gene,SNPs in children witii idiopathic ' 

Imperial Cancer Research Fund . \ 1989-1992 

London, UK with Dry Ellen Solomon . 

6/89 -12/92: Postdoctoral Fellow 

• Cloning and characterization, of the genes fiised at the acute promyelocyte leukemfer » 
translocation breakpoints on chromosomes 17. and 15. : , 

• Prepared a. successfully funded European. Union multi^center grant application 

McM aster University 1983 
Hamilton, Ontario, Canada with Dr. G, D, Sweeney 

5/83-8/83: NSERC Summer Student : 

• In vitro metabolism of p-naphthoflavone in.C57Bl/6J and DBA rtilce 



EDUCATION ; 

-Thenotyp^ • . 

the human retinoblastoma gene." ... : 1959 

Supervisor: Dr. R^PhiSps : - ! b^!^ 

Honours B.Sc McMaster University, 

"The in vitro metabolism of the cytochrome P-448 Hamilton,. Ontario, Canada- 1983 

- inducer p-naphthoflavqne in C57BU6J mice.- Department of Biochemistry 
Supervisor: Dr- G. D. Sweeney. . : 
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ACADEMIC AWARDS 



Imperial Cancer Research Fund Postdoctoral Fellov^hip ; ... ^ 198^1992 

. Medical Research Council Studentship ; r1983-1988 

NSERC Undergraduate Summer Research Award . ig83* V 

'Society of. Chemical Industry Meiit Award (Hons. Biochem.) . / - . 1983 . 

Dr. Harry Lyman Hooker Scholarship : ^ . ' . . ; - . . 1981-1983 

j;LW: Gilt Scholarship : .;i981-t982 

Business and Professional Women's Club Scholarship ' .1980-1981 : 

Wyerhagser Foundation Scholarship > 1 1979^1980. 



invited presentations " V 

. Genentech's gene discowiy pipeline: High thrbughputident^ 

characterization of. novel genes, functional Genomics: From Genome to. Function Litchfiefer 
. Park/AZ, USA. October 2000 \ / ; ^ , . 

, High throughput identification, cloning and characterization of novel genes, G2K:Back to 
Sdenice, Advances in Genome Biology and Technology I. Marco Island; FL 'USA, February 
2000' *: • : _ . /■ " 

. Qfiiality. cpntrbl in DNA Sequencing: The use of Phred and Phrap. Bay Area Seq uencing 
. Users Meeting, Berkeley, CA, USA, April! 1999 . . 

High throughput secreted protein identification and cloning, .Tenth- International Genome 
: Seqpjencing and Analysis (inference, Miami, FL, USA v September 1998 

The evolution of DNA sequencing; The Genehtech perspective; Bay Area Sequencing Users 
Meeting j Berkeley, CA, USA. May 1998 

Partial GrowtK.Hormone insensiti\^.Therole of GH-receptor mutations/in Idiopathic Short 
Stature, Tenth. Annual National Cooperative Growth Study investigators Meeting- San 
Francisco, CA, USA. October, 1996 ; : : . 

.. . Growth, hormone (Gtfj recep&f defects are present in selected chiltdUren with norv-GH-deficient 
: short stature: A molecular basis for partial GH-insensjtivity.; 76 th Annual Meeting of The 
Endocrine Society; Anaheim,. CA, USA. June 1994 . -, V 

." Aprevibusly unc^aracterized gene. myl. is fused to the retinoic acid receptor alpha gene in 
acute promyelocytic leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Itaiy, October 1991 



:■::?;■■:■:<>: -\ .. O. • .'v • .. :'•.•;•-'■.:•••:-£) -V*V:-- 

• c ' PATENTS V/ .' V'-,V .•; ./v'. • . . : '•' ']■ •'••V-\ " '•. "' 

Goddard A.Gafowski PJ, Gumey AL NL2 Tie ligand homologue polypeptide Patent 
Number 6,455,496. Date of Patent Sept 24, 2002. - . 

Goddard A, God'owsW PJ and Gumey AL. NL3 Tie ligand homologue. nudeic. adds • Patent 
•,• Number>.6 f 426,218. Date of Patent July 30; 2002.^ . . . ; . 

'■: '. . Godowski. P, . Gurney. A Hillan KJ; Botsteiri D, Goddard A Roy M, Ferrara N, Tumas D 

Schwall R NL4 Tie. ligand homotogue nudeic add. ..Patent Number. 6,4137 770 Date of 
- . . . . . : Patent July 2, 2002! • ; . . 

"'.V- "•' ' ■ \ : AshkenazfA Fong S, Goddard A Gumey AL, Napier MA Tumas D, Wood Wl. Nudelaacid' 
^ encoding A-33 related antigen poly peptides. Patent Numben 6.4.10,708 Date- of Patent- 

. • • "" . Jun. 25, 2002.. : / . . \ , ; - 

Botsteiri. DA Cohen RL, Goddard AD, Gumey AL. Hillan .KJ, Lawrence DA, Levine AJ 
Penriica D.Roy MA and Wood Wl. WISP polypeptides and nudeic adds encoding same' - 
*; -: • Patent Number. 6,387,657. Date ofPatent May 14 r 2002i . . . ; r 

. Goddard A' Godowski. PJ and Gurney AL. Tie iigands. Patent Number: 1 6.372:491 Date of ^ 

' Patent April 16.2002. v . • 

• - Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand homologue antibody Patent 

Number 6,350,450. Date of Patent Feb. 26.2002.. 

v • Fong: S, Ferrara- N, Goddard' A Godowski PJ, Gurney AL, Hillan K and Williams PM. Tie 

. receptor tyrosine kinase ligand horfiologues. Patent Number 6,348,351. Date of Patent ' . 
Febl.19, 2002. •> : . 

Goddard A Godowski PJ and Gumey AL Ligand hbmolbgues. Patent Number 6,348 350 
Pate of Patent: Feb, 19 r 2002. . . , -. . • 

':. -. Attie . KM, : Carisson LMS, Gesundheit N and Goddard A . Treatment of partial- growth 

hormone insensith/ity syndrome:. Patent- Number 6,207,640: . Date of Patent March 27 
" . 200.1... ••• . .•' . ' ' ' : * • •*•* 

Fong S, Ferrara N, Goddard K Godowski P J, Gumey AL. Hillan K and. Williams PM: Nudeic ^ 
adds encoding NL-3. Patent Number. 6,074,873. Date of Patent June 13, 2000 

. . Attie K, Carlssbn. LMS, Gesunheit N and Goddard A Treatment of partial growth hormone 
ihsensitiyity syndrome; Patent Numben ^5,824.642. Date of Patent October 20,. 1998 

.Attie K, Carlsson LMS. Gesunheit N and Goddard. A Treatment of partial growth hormone 
: .. insensitivity syndrome. Patent Number 5,646,113. Date of Patent: July 8,1 997 I: . 

• • ' *' : Multiple additional provisional applications filed 
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f _ Seshasayfee p, Dawd p; Gu G, Ericfcson Goddard AD" Comparative sequence ahalysis ot ' 
t^ HER2 locus in friouse and man;. ManusOTpt in; preparation. 

: Abuzzahab MJ; Goddard A, Grigorescu F, Lautier C, Smith RJ and ChemaUsek SD.. Hiiman 
rfGP-1 receptor mutations resulting in pre^ and post-natal growth retardation. Manuscrjptin 
/preparation. . - 

. . Ajggafwal S/Xie; M-H,.Foster J, Frantz G.Stinson J.Corpuz RT, Simmons L r Hif lan K, 

Yansgra DG.Vandlen RL; Goddard AD.and GUrney.AL. FHFR, a novel receptor for the 
.\ , fibrtibtest.growth factors; Manuscript submitted. V ■■ 

Adams SH/Chui C t Schilbach.SL, Yu XX, Goddard AD, Grimaidi JC, Lee J, Dowd P f Cbftnan 
S. t Lewin DA (2001 ) BFIT, a unique aeyl-CoA thioesterase induced in thermogenic brown 
adipose tissue: Cloning, organization of. the human gene, and assessment of a potential link 
to obesity^ B/o<^^ . . %:\ \ 

Lee X Ho;WH. Maruoka M. Corpuz RT. Baldwin DT: Foster JS. Goddard AD. Yansura DG. - 
Yandlen RL- Wood Wl. GutrieyAL:(2Q01) IL-17E. a novel proinflammatory ligand for the IL- 
.17 receptor hbmolog IL-17Rh1 . Journal of Biological Chemistry 276(2): 166<M664. O 

; Xie MrH, Aggarwaf S, Ho. W-H. Foster j, Zhang Z, Stinsdn J, Wood Wl, Goddard AD and 
' Gurney AL (20DO). Interieukin (jL)-22, a novel human cytokine that signals throughthe 
... Jnferferon-receptor related proteins CRF2-4 arid IL-22R. Journal of Biotogicaf Chemistry 275- 
; '31335-3133Q. /■ . . ./ I'./. / : ; t . * 

Weiss GAi Watanabe CK, Zhong A, Goddard A arid Stdhu SS: (2000 j Rapid mapping of 
. protein functional epitopes, by combinatorial alanine scanning. Proc. NatL Acad. Sci USA 97 : 
895(^8954. . . ; ; " ... 4 : - . 

Guo S; Yarpaguchi Y, Schiibach S, Wada t 7 ;LeeJ t Goddard A Frenc* D , Handa B, 
Rosenthal A, (2000) A regulator of transcriptional elongation controls vertebrate neuronal 
. development Wafure 408; 366-369. .■ 

Yaci lyi, Wang L-C, Hymowitz SG, Schiibach S» Lee J, Goddard A, de Vds AM,Gao WQ, Dixit. 
,YM. (2000) Two-amino add molecular switch in an epithelial morphogen" that regulates 
binding to.tvi^ distinct rera^ 

_ SeW PD- Tai JTN; HillanKJ, Brovyri fc£. Goddard A, Yang R, Jirv H and Lowe DG. (2000> 
Application of cDNA.microarrays in determining molecular phenotype in cardiac growth, 
, development, and response to injury. C//icuMon 101:1990-1999. 

Guo S, Brtish J, Teraoka H, Goddard A*. Wilson SW; Mulliris MC and Rosenthal A. (1999). . 
Development of noradrenergic neurons in the zebrafishhiridbrain requires BMP, FGF8/and 
the homeodomain proteiri soulless/Phox2A.Neunw 24: 555-566^ ; 

; Stone D t .Murone, M, Luoh v S, Ye W, Armanini P;. Gurney A, Phillips HS, Brush. J, Goddard 
. A,.de Sauvage FJ and Rosenthal A.. (1999) Gharacterization of the human suppressor of ■* 
fused;" a negative regulator of the zinc-finger transcription factor Glf. J; Cell Sci: 112: 4437- 

. .MM- *: 

■\ Xie M-H f >lolcomb I, Deuel B, DowdP, Huang A; Vagfe A,.Foster J; Liang j; Brush J, Gu Q, 
Hillan K f Goddard A and Gurney, A;L (1999) FGF-1 9; a novel fibrQblast growth factor with 
unique specificity for FGFR4. Cytokine 11: 729-735.. 
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Van M,.Lee J. $chnbach : 5,.Goddard A and Dixit V. (1999) mE.10, a novel .caspase- : '" - 
recmitmentdomain^ntainihg-proapoptotlc rnolecuje. J. BiotChem. 274$ 5): 102&7.-i02?2. 

GUmey AL I; Marsters SA, Huang RM,.PittittM;Mark DT, BatcfwWDT; Gray AM Dowd P 
Brush. J. Heldens S ; Sehow P, Goddard AD, Wood Wl, Baker KP, Godowski PJ and l " 
Ashkenazi A (1999). Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog.of moiise G\11<:CumntBiblogy:^4):.2i5^2^8. 

Ridgway JBB, Ng. Ei Kerri jA ,Lee J, Brush J, Goddard A and' Carter P; (1999) Identification ! 
of a hurnan anB-CD55 single-chain Fv by subtractive. panning of a phage library using tumor 
and nootumor cell lines. Gancer Research 59: 2718-2723; .-.•.„. 

I PitH RM, Marsters SA, Lawrence DA Roy M, Kischkel FC. Dowd P, Huang A Donahue CJ 
.Sherwood SW, Baldwin DT, Godowski PJ; Wood Wl, Gumey AL; Hillan KJ Cohen RL 
Goddard AD, Botstein D and.Ashkenazi A. fl998) Genomic amplification of a decoy receptor 
foc Fas ligand Jn lupg and colon cancer. Nafure 396(671 2): 699-703. : . f : 

Pennica D, SwansbnTA Welsh JW, Roy MA Lawrence DA Lee J, Brush J. Taneyhill LA 

• Deuel B.Lew M.Watanabe.Ci Cohen RL, Melhem MF. Finley GG, Quirke P Goddard AD : 
Hillan KJ, Gumey AL, Botstein D and Levine AJ. (1998). WlSP genes are members of the ' 
connective tissue growth factor family that are up-regulated in wnVI-transformed cells arid^ 
aberrantly.expressed in human, colon tumors. Proc.Natl. Acad: Set USA: 95(25>; 14717- 
"14722.' • • • .' ; -. ' .' •■ •!"-;' '•' 

Yang RB, Mark MR, Gray A,. Huang A Xie MH, Zhang M, Goddard £ Wood Wl Gumey AL 
and Godowski PJ. (1998) TolMike receptor-2 mediates lipopolysaccharide-induced cellular' " 
signalling. Nature 395(0699): 284-288. . . . • \ ! . 

.Merchant AM, Zhu Z,. Yuaa JQ; Goddard A, Adams CW, Presta LG and Carter PS (1998) An 
efficient route to human blspecific IgG. Nature Biotechnology 16(7): 677-68.1 . 

Marsters SA, Sheridan J P, Pitti RMi Brush J, Goddard A and Ashkenazi A (1998) . , v 
Identification of a Ikja'nd for the deaft-dOmain-contaihing receptor Apo3. Current Biology 8(9):- 

Xie J, Murone M, Luoh SM, Ryan A, Gu Q, Zhang C, Bonifas JM; Lam CW. Hynes M, 
Goddard A, Rosenthal. A. Epstein EH Jri andde Sauvage FJ. (1998) Activating Smoothened 
mutations In i sporadic basal-cell caranoma. Afafure. 391 (6662): 90-92: 

Marsters SA, Sheridan JP, Pitti RM, Huang A Skubatch M, Baidwln D- YUaii J Gumey A 1 

• Goddard AD, Godowski P and. Ashkenazi A. (1 997) A novel receptor, for Apo2LfrRA.IL ' . 
contains a . truncated death.domain. Current S/o/bgy. 7(12): 1003-1006. : 

Hynes-M; Stone DM, Dowd My Pitts-Meek S; Goddard A Gumey A and Rosenthal A (1997) 
Control of cell pattern in .Hie neural tube by the zinc finger trahscription factor Gli-1 Neuron 
-19:15-26:'. •.'•'. '. ; ••■■ . ■.' . • 

Sheridan JP, Marsters SA, Pitti: RM, Giimey A., Skubatch M. Baldwin D, Ramakrishnan L 
Gray CL, Baker K, Wood Wl; Goddard AD, Godowski P, and Ashkenazi A (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and- Decoy Receptors. Science 277 
. (S327)i 81S-821. . 
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Goiddard AD; Dowd P t Cherriai^ek S, Geffner M, Gertner J, Hintz R, Hopwood N f Kaplan Si 
Ptotnick L t RogoF A; Rosenfieid R, Saengef P, Mauras N, Hershkopf R r AnguloM and At8e, K. 
(1997) Partial growth hormone InsenslBvity: The role of growth hormone receptor mutations in 
JdiopathiG shorts^ - ; 

. tiein RD, Sherman D.HoWH, Stone D, Bennett GL, Moffat B; Vandlen R,. Simmons L, Gu Q, 
HcmgoJA/DeyauxB, PoQlsenX Annam 

Henderson CE ; Takahashi Mand Rosenthal A. (1997) A GPMinked protein that interacts with 
Ret to.form a candidate nedrturin receptor. Nature. 387(6634): 717t21 . 

Stone DM, Hyries Mi Anmanihi.M. SWahson TA, Gu Q; Joiinson RL; Scott MP, PennicaDV 
V Goddard A^ Phillips H f . Noll M, Hooper JE, de. Sauvage F and Rosenthal A (1 996) the . 
. toirour-suppressor gene patched: encodes a candidate receptbrfor Sonic hedgehog Nature 
.'384(6605): 129-34.; • . 

Marsters SA, Sheridan JP, Donahue CJ, Pitti RM,.Gray CL, Goddard AD, Baiier KD and 
Ashkenazi A. (1 996) Apo-3, a new member of the tumor necrosis factor receptor family,. - 
contains a death tfomairl and activates, apoptosis and NF-kappa pi Current Biology 6(12): 
1669-76. • ■ • ... •■*».:.*. 

.. ,Rothe M* Xiong J r Shu HB,. Williamson K, Goddard A and Goeddel DV. (1996) l-TRAF is a 
' novel TRAF-interacting protein that regulates TRAF-mediated signal transduction. Proc.Nati 
. Acad ;Sd. USA 9S: 8241-8246. . . . . [. 

-YangM, Luoh SM^Goddard A, Reiily D, Hi^nzel W and BassS. (1996).Theb9lXgene- 
located at 47 .8 min on the Escherichia coli chromosome encodes a:periplasmic beta- 
gluoosidase. Wifcrab/o/ogy 142: 1659-65. 

: Goddard AD and Black DM. (1996) Familial Cancer Fn Molecular Endocrinology of Cancer. , 
; Waxman, J. Ed. Cambridge Uhwersity Press; Cambridge. U^pp.l 87-21 5. 

Trearior JJS, Goodman L,.de Sauvage F, Stone DM, Poulson KT.Beck CD, Gray Q; Armanini 
MP, Pollocks RA r Heffi F ( Phillips HS, Goddard A, Moore MW, Buj-Beilo A, Davis AM, Asa! N , 
Takahashi M, Vandlen R^ Henderson CE and Rosenthal A. (1996) Characterization of a 
receptor for GDNF; Nature 3&2: 80-83. ■ \ 

"v Klein RD, Gu Q, Goddard A and Rosenthal A: (1996) Selection for genes encoding secreted 
proteins and receptor?- Proa NatLAdad.ScL USA 93: 7108-7113. ■ . a * 

Winslow JW; Morari P, Vatverde J; Shih A, Yuan JQ; Wong SC; TsaLSP, Goddard A Henzel 
WJ,. Hefti F and Garas t (1 995) Cloning of AL4, a ligand for an Eph-related tyrosine kinase 
receptor involved in axon bundle formation. Neuron 14r 973r981 . 

Bennett BD, Zeigler«FC, Gu Q, Fendly B; Goddard AD, Gillett .N and Matthews W, (1995) 
Molecular ptonihg of a ligand for the EPH-related receptor prdtein-tyrosine kinase Htk. Proa 
Natj: Acad. Scl; USA 92: 1866-1 870. . ; ; V 

HUang X, Yuahg j, Goddard' A, Foulis A,. James RF; Lernmark A, Pujol-Borrell.R, . 
Rabinovitch A, Somoza M and Stewart TA. (1995) Interferon expression in the pancreases of 
. patients with. type l diabetes. O/afietes 44: 658^64: ' . .. 

Goddard AD, Yuan JQ' Fatrbaim L, Dexter. M, Borrow J : , Kozak C and Soforhon E. (1995) 
Cloning of the murine homolog of the leukemia-associated PML gene. Mammalian Genome 
. 6: 732-737. r \ • 
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Goddard AD, Covello R, Luoh SM; Gbckson T, Attie KM f :Gesiindbeit N t Rundle AC, Wells 
JA.Cartesbn LMTI and TheGrowth Hormone Insensitivity Study Group. (1 995) Mutationsof 
the growfti hormone receptor in children with idiopathic sho 
.1093-1098.; ; . w'.'. \. ;■" y J. ' - . \ * ' ' 

. Kiio SS f Mbran P, Grippe Amianini Mj Phillips HS,. Goddard A and Caras IW. (1 994) 
Identification and charaderization of Batk, a predominantly brain-specific non-receptor proteia 
- tyrosine kinase related to CsIC J. NeuroscL Res. 38: 705-715. 

Marie MR, Seadden DT, Wang Z,. GuO, Goddard A and Godowski PJ. (1994). Rse.a novel 
receptor-iype tyrosine kinase with homology to Axl/Ufov is expressed at high levels in the • 
brain. Journal of Biological Chemistry 269: 1 0720*10728. : : ". \ «* " • - 

-Borrow J; Shipley J/Howe < Kiefy F, Goddard A, Sheer D, Snvastava A, Antony AC;* 
Fioretqs T f Mitelman F and Solomon E. (1994) Molecular/analysis of simple variant ' ' . 
;tran?locations in acute promyelocytic leukemia Genes Chromosomes Cancer & 234-243. 

Goddard AD and SOloriidn E: (1993) Gepetips of Cancer. Acfvi Hum- Genet 21: 321-376. 

{Borrow J, Goddard AD- Gibbons B t Kaitz F, Swirsky D, Fioretos T, Dqbe i; Winfield DA; ' ■ ' . 
Kingston J, Hagemeijer A,:Rees JKH..Lrster ATand SbiomorvE. (1992) ; Diagnosis of acute " : 
{promydocytic^ . 
transcripts; Br. 1 Haematol 82: 529-540. * . . ;■ . ; * 

Goddard AD, Borrow J and Solomon* E. (1992) A previously Mncharactenzed gene, PML, is 
fused to the retinoid add receptor alpha gene in acute promyelocytic leukemia. Leukemia 6 
s^pU^ti7S-it9S. v ; . r>:J;\\:y/\ : 

. Zhu X, : Dunn JM, Goddard AD, Squire JA, Becker A, Phillips RA and Gallie BL (1992) 
Mechanisms of Joss of heterozygosity in ^ retinoblastorng. Cyfogenet CeC Genet 59: 248-252. 

Foulkes W, Goddard A.' and Pafel K. (1991) Retinoblastoma linked with Seascale [tetter] 
; British Med; J. 302:409. ;V/* : . \ . ' v : "•,/ 

, Goddard AD„ Borrow J, Freemont PS and Solbmbn E. (1991) Characterization of a novel zinc 
finger gene disrupted by the t(1 5;17) iaacute promyelocytic leukemia. Science 254: 1 371- 

• 1374/; . . .r- -v..:"-;. . m 

"Sofomon E t Borrow J and Goddard AD. (1991) Chromosomal aberrations in capcer. 'Science 
254; 1153-.116ti; ; ^ . -# . *, .r;./"? 

Paiunen U Jones f A; Goddard K Sheer D, Solbmbn E, Pihlajanfemi T arid KivilikkaklV • .' 
. (1991) Regional assignment of the human gene coding for a. multifunctional peptide (P4HB) 
acting, as. the p-subunit of pro!ylr4-hydrpxylase and the enzyme protein, disulfide. isomerase to 
17q25, Cyfogenef. Cell: Genet. 56: 165-168.. 

Borrow J; Black DM, Goddard AD, YagJeMK, Frt^ehauf A.-M andSolomoaE, (1 991). 
Construction and regional localization of a Wpfl: linking library from human: chromosome 17q. 
GenomicsA 0:477-480. 

Borrow ^" Goddard AD, Sheer D and Solomon El- (1990) Molecular analysis of acute 
. promyelocytic leukemia breakpoint duster region on chromosome 17. Science 249:1577- 

.1580. * 1 : * ' ' - ■ . ■' - ■ • 
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Myers JC, Jones TA, Pohjolainen E-R, Kadri AS, G^ 

Pihlajaniemi T. (1990) Molecular cloning of. 5(IV) collagen and assignment of the gene to the : 
regtortof the region of the X-chromdsome containing i the Alport Syndrome locus Am J Hurrt 
Genet 46:1024-1033. ; f '. ' * * .. ■". 

Galiie BL, Squire JA, Goddard A, Dunn M, Canton Mi Hintori:D,.Z*»uX'and Phillips RA. 
(1990) Mecrtanisms of oncogenesis in retinoblastoma. Labi irivestSZ: 394-408. . 

Goddard AD, Phillips RA, Greger V;;Passarge E, Hopping W,^ Galiie BL and Horsthemke B 
(1 990) Use of the RBT cDNA as.a diagnosticprobe in retinoblastoma families. Cffn/caf • 
Ge»efics37: 117-126.- ;. /-\.- ." •''■} .;• ' ; ..7- ; \.\ 

Zhu XP,.Dunn ;JM, Phillips PA Goddard: Ap, Paton KE, Becker A and Gallie BL (1989):- '. • 
Germline, but not somatic, mutations of . the RB1 gene preferentially involve the paternal 
allele. Nature 340: 31 2-314. . . .'\ ; v" •"»•;•>■.■ T 



Galiie BL. Dunn JM. Goddard A Becker Aand Phillips RA (19.88) Identification of mutations 
in the putative retinoblastomagene. In Molecular Biolodv of. The Eve: Genas. Visionand . 
Ocular Disease. UCLA Symposia onMoiecular apd Cellular Biology; New Series, Volume 88 
J. Piatigorsky; T. ShinoharaandP,S. Zelenka,. Eds. Alan R Uss, Inc., New York; 1988 dd 
"427-436..' .''•.:•:*,>'.•:•'*..••'••!':• ' ' : •' : " ' . " "' . .:-/*'"••..••" 

; Goddard ADv Baiakier Hi Canton M. Dunn J, Squire j; Reyes E, Becker A Phillips RA and 
GaBie BL; (1988) Infrequent genomic rearrangement and normal expression of the- putative 
RB1- gene in retinoblastoma tumors. Mot Celt Biol. 8:.2082-2088'./ ; . 

Squire J, Dunn J, Goddard A Hoffirrian T, Musarella M m Willard HF,. Becker AJ, Galiie BL and 
Phillips RA. (1986) Cloning-of the esterase D gene:. A polymorphic gene probe closely linked 
to the retinoblastoma locus on chromosome 13. Proc. Nati. Acad. So. USA 83: 6573-6577. 

Squire J, Goddard AO. Canton M, Becker A, Phillips RA and Galiie BL (t986) Tumour 
induction byUie retinoblastoma mutation is independent of U-myc expression. Nature. 322- 
555-557. • • 



of bilateral retinoblastoma patients .as determined by. micronucleus induction in vitro; Mutation 
Research 152: 31-38; ' ••; ; .' • 
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"of t^hH^jdbiiv .Wiii^h d^n^ oii 
probe! T^/i^^ struct. 



: pidft*.!^ ink * 



ture iir probe .t^glat^ a^gafihg 



. . bdund : probe l^^.S %^ 'ani 
q^ien^et iiyes. TCik/j^ea^ge is d^er*- 

*tvhtkv j^^^A^t^capi^ as showp 
. tfe$ o&^ 



probed- 

, Tlie dse In RQ~ values <for : the Afcstr- - 

: : i4^e djf. ^^cfiii^ "lis ^4u^d>^jo^ 
. y&at as -fte quezt&er & placed ■ toj&aict 
the j'fendvraVlpw^st ^p^^t^u^a^i- 

3) ^et th^ for the pr<Dbe ivhexe ate 
. :^aMjia: vis tue 3? xAJv26) v Tkite i$ 
iflidtot^dab^ 

beiess;*^^ ; 
of an liit^^> jk^fl^ In ifeffecti a 
qS^cft^: ^ th^ ^^d^s freer !ta^6g^ l 




■ 3f :pj6ie is ; l&i~ihaxi foj the Mteniiliy : 

V-ihat , '^Idfis/tfie: V&tte Air 

^Ajt^^^ ef- 
• . ; ^&bi^i |^e : Qia^ l>e .a jmodest . 

,prob^Ieay^ge ? ^Ktod^dr^ 

pr0^ f iS^is great^iioir the 3 1 . 

^^•Tiwa^idt the 3^nd^mor^lik^y 
5 ^'Qiter: and 
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gfcne; which if airipUn^d- in ' &f ,&teast 'tiji-; 



quantitative analysis of riuyetc^atitf sequels 
#tbthfor^ 

al. i9SQ; Thomas 1980), frecerfoly, PCfc'has 
proven to be a powerful tool, for quantitative 
nucleic add/anatyys. PGR arid* tfeverse transcrip- 
tase (RT)-PG& have- peritiitted tire arfcivsfc of. 
mjntoal staritin^ quarititi^ of nucfeic acid^as 
tittle as one £el] equiYaleRtji lfofc%a$ otade pos- 



^perfQrmed -vtfth ^aai«q'rtal -,teebh6dfe. Altrtfogh 
.P<5k has provided a p^werM tooJ/ftts jmpmtWe 



E-MAIL' mi3<.cy^gcn< c<>m; FA£- (41 S) 22s5;,t.4Tl . 



.^it; itj# ^ 

of^ikecs i# ; S). Itf ari^ eafrly. &$£i$s of quanta* 
- qve 'fcfc^ar^^ 

tr^^^ 

fer^fe^ 
upjrol^^ 

\^ Gt4nQ€fhfb -*jt at r 



of gaalatltatlv^ andtflT^plpft. ^iie^pp^adi 



159% « RaHg et at i$90). fhi$ ^ .m^thbd; teqiites 
*that Va^tf -sai^pie jias equal; input?. A^uftH df 
ftu^it atldafid thai^each sa^Ie u^ca^alysfs 



ttfi^/;^ used 4bj:'. sanifeie 

a^p^^atio^ effictejicy noxitjalizatioa. $«ng 
:c^p^hil0.na] ^etjiod^ot fcCR ^ d^tejiH^; va^d 



re 



hy^ridi^yoti), it is e*?r<em'$l^ 



•h6^|l1^^d:^|ie),. Anther rijethc^, :)patilita. 
tfye eoittpemiye 0?C>-PCR; has beert dj^dped 
ancjfis^s^r^ 

relies <jn the indusipn of an irritemaj <c<Jf|trdl 



re- 

^atf toh> Is pgilli^ (iprtrte irijferiial tou 
A &iow*v arrjpujit of iitf 



. . . f). 
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•?ap>ple throughput: of ^^tB#ie^^o&-vte*Wi?^. 

.•McsiprfdJaicalttJafc). '., • 

:-1M**t$to* the a^»pm^t fi f a. ^ 
.^say: fet-.qjian^fattv« ^©«Avafiajys«~thea$&y:fe 
based . «MHp use pf they^ jvueWaS*: assav/^ist 
dese^bed^ Spiled # afcOmfc Tfe^etHbtf ■' 
»ses<heSr..mi^easfi At^jpU^mfe 
cleave a ntmectmdible ti^riti^ito p>t*e *fc ■ 

« : an lS9Sa;B)f; On* fluoresi&nt .djre serves as. a 
reporter (F^M Qje., ^Tca^yfluotWceiitjiand its 
«^iMic!n sp^tra =is;qpettched by the-se^MBtib- 
^«HPA (i.e- ( 6-carb6xynetramethyl- 
: rtV^arolqfe), the nueleaSe^adatfehof ,t«e%; 
toridfeaUoH probe releases the g^tog af ^ 
^AM flddresqent emission, ^U)tttg: in ati ifr, 
erease:fe:peak Ifo^*^..^^^^ 
TOe.use of as&iuence de^^^^ 
:^^«^^bffluo«sceMspj^^f^^ii. 
of the thermal-cycler eottphwousfc . -the . 
*CR amplification, ^refor^e. reactiotti 
Wnitorfed in /eal time, ^^puVdita .W-de 
*c$bed ; atttf^qtftatlvje ^na^is^tnptit^rfee* 
BNA sequeneesTs discussed feeiow. •". ' ; ":' 




Mate YABI Prism $1triiei*Z.A-&. : 




djg^po^^^t}^, ^pther,a/riuehefr 

:&m* *^:-m*is&f c^^sto'd: Reporter 
.^;flu^e?^tvetnfeio^ abtfor^^fhe 




toe P*ob*th*:^ 




MdW|^y©^i| l ^ 0 ^v^^'ai.'" W Ke per- 
fo.c^ed,to. chW0sej.tHe appropriate^ p/obe and 
magnesium concentrations yieldlnWttie Whest 
Ititenslty of jepprtet- fluoMcerit'^aJ-wiihout 
»«rfi ? ing;sp-eci.fleity:; The> instfUiSi^m.^es'a 
cb^cogpted deV.i W - <pGb*afoeraj for 

seq^entrajlyror 25" msec ^itb <«ri«nubus j«om- 
tOrmjg^hfoughput- ffo arfrpHffcatten. %<;ft^ube 
was rc^a^ined ^veify. 8."$ secf. ^smptfter sbft-V 
ware- vas deigned to ^ine ; the:iEi«6f^ent in- 
: tensity, of bat^ ,,the rppottfir d^e .(PftMi and 
the wenching dye iTAM^A). The ,fliiOrescent 
inten^yaftbe^Uftn 

very iltUe ovet the coarse Of. the feCR aoiiillfi^ 

Sgr ShbWrt)i ^eWe/'tbe:fhtens»y 
of J^MRA; dye emission serves as aij iritetei 
«a P da^;with.is*[% , t 0 oOrnfaifee tbe feporter 
dye if^mm^^ma^-vi^^-^i. 

tion. nfee, WdVRV- emission. : intens%)^ of te. 
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FFgure 1 RCR. product deftec\iort:in.raalJtjfx<ft.. £4)* TheiMjodel /Z&Q software will construct amplication pilots 
viation Is dttermifted from "tfte M fta jjte\ri&^^ 



pbihtS' reftr&em the insert 4>f ^l^te f GR- ^JlJtea^orTs; p^d^rbr a re staswti ^at ^pt dWkys >tstt)fejii 
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^W^fhe^lMKfi^ipJots shift fo 
d^jradp #>dugh> probe to ^aiirtfc* v : v " ■ 

tB does not.reftect thesame fiudt^ehf:rate. ol 
mcte^hibitedbym^tpfth? othW.snt>pj«. 
TOe IS-ftrng sanuple ^J^ achieve ertdp&ritW 
feau at a Wwer fluorespenf value th<rn Woirtd I be 
expect^bwfed on ftt jftpat BNa. thiifD^*. 
■ enort has been observed occasionally *fch o'thef 
samples (data not shown) a^tf^y ^amibut. 

^^• a 1 J ^ ^ earlyptaftauqdno. 
impact » g rtif lG aritty. the calculated G T value as 
demomttated by the fit on *he line shown " 
„£f B . 1 ^ A, 'trtplicate amplifications resulted in 
very, similar Cr v a lu«-the .standard deviation 

mni^^z>m m , m . n ^ ofinputtar- 
-jjtt molecules. Using Q Values foV qUanStatioh 

using total ;flupj*scent emission intensity for 

tensity increment of the ABipVism 77QD & 
quence De^toionly spans; tfiree logs, restfLg 

eeules. Thus, C, Va'Iue^. pi0 ylde accuse nteasute- 



a|by4.facto^^^^ ■ 
•Rr^ifatio^s ^%jj'mi„^i j Afcefe&r&rn^ik 

Iicate. : ^i6.^ : ^i^ 

,qua rt ,UaMv6;PCR,anal^. <^m.pamo& {rf 
nttions.also showed jrai«imal Va^bii^ tojil^. 

P^vnteto^xny-tf th6 . sam p fes ^ (J^ 
akd.0.71 : ^F the 1QS arid 25. ng satrtpj^ 'fespec, 

^tnpleve^ibitfta. ft e^Kjaietlt rafeS RuorS 

^opes=4er.ved&om the saffipfediluttdn S (Fig ^) 

tor -would exhibit a .greatet measuf«l ^ aC cin C T 
value Tor a-giveb : quantity of 0N A; in: addition 
•he inhibitor would be diluted' ^ 
s?mpi^, 0 the^Uution ana^ (% ^/^W 

_«! ^^J^tip»yieldedva^m.Slarr&ultJn .^ajialysifc 
^J^ghly ^0du^ ^,rfe>b 

Quantitative Analysis of ia.Plasmld After 
TransiebtTransfettibn 

m celis w^e,tranSi^nay trdnsfect«.d with-* vet- 
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4or comaiXiirig: a partial c6NA fox huTxian fa.GtQr. 
Vil!, ,pF8TM. A s&rier of tr^sf^ : ipps..was ^et 
up : M$ing;:a deceasing amount of tbf^I^^i^(40, 
4> O.S; and 0.1 .^g). Tw.eniy^fpar hour^ pp5t- 
trajisfectipn, total DN A was purified i.fteim each. 

: : nask<if Ceils. ft^Actin genequaritjity wXsc.h.osen as 
a Valiie. for hor<nalizatiqn of genornic t}ti'X con- 
centotioh ffOm each sa:mple:;In;fhrs^x 
(i^ctln. gene qont^nt . should, remain eonstant 
tefafivetotbtal.geftomlc.DNA^^ Hgu're 3^shov/$ tfee 
<£srit o? ng 
iotal DHA* .determined by ulkaViqtet- spe^ros- 



in -tictpdicate the mean pT&tjji of 
.th:€( ; iriphcit«w 

one Standard deviajtion). tfjfe !higjfc^:ai^ 



b^weeii any ^ two 5ftm^le. means Was^SCt,. Ten 
nanq^ams .of tpt^i^^ of each ^samp!| weri; raiio - 
examined for pectin. The i^lj-s.jtfgiiiiri ^ sr^pwfed 
tbfct very slrhUsr arnpunts oT geinomic DNAxVyere 
pres?ot; ^he m^xirnvini mean . ^-actln CT yuJue 
4if<erence ; ,wa^.l ^ ftgure 3 snoyys; Mie ra(le of 
B : a^Tnf <V cnange; D^fween the, 100- and 10-^ig 
sampres. y/as 5bpait$r (slope Valuer . i^n^-.be^een 
- 3^6 • anit - 3.45); VF tils . y^^,agaih ; that -the 
.method of ;$'afmpi.e pY^paraUon^eids "^i^ples pf 
■ idfinli&t ^^irftej^iiy (ie v no s^rn^ cptttained 
An excessive Sn>bunt PCR'irthibLtqif^ How- 
ever; thes* Tefctifos indicate tftat^eji.^ 
st^e^^^>^iff^^f:^- in tf)e.act:uat ^ainpur>t.j6:f 

genomk :1 DNA;<pQt^^^ -tife* ac:cpm^r?ed . 



o 




S^*^„i h * ^ Gr'Mu« obtained 




com?). Eaeh samte jwas ,anaiv=r«j- t* Jfeii^ 



m triplicate 




« tS 
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(delerni^bj, ultraviolet sp&ttbs«>py> 
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tae 293 cells: 24 hF a A a* fro«cfi,w^: 1 jV^ --^v 
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Tttti results m -529,pjasmid copl&fijper eellV; 

ha;;* bribed a new method for <t«aniitat. 

^•^ ^Hneahpn?. R^-Hme PCR-Ss compat- 
ible, with fei0ier t3f;tt)e. two l^R (Rt^GRJ^p- 
•TCpa'ches: 0> quantitative. c6ttipef#ve ^hele an 
«tteri$ Competitor for each tajgei seqiiejcipe « 
Vied fer.nortna^izatipn <data Inot.shown) Or ti) 

m ox a •'hQtiseke.cping;; gene M RT-PCR. if 
equal amounts of nucleic ad* are ajlaWl. for 
eaeh sample and if the ampJiflcation efficiency 
W^e..q^maM S »e. ? n^sjs is. identical fofeach 
sample, the Internal control (normalization gene 
or competitor) should giVe equal signals for all 
samples.- ' ' . 

The : ttalrtimePCR method *ffeb several ad- 

e/npl<^ (see^e |af reaction). Fjtf^the reai- 
nme PCR^nejhod fs ^ ^rfd>med1h ^sed-tSe 
jgffijm afid.requtr^ nO-p^r-PG-R p^r|pg«ion 
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ABSTRACT Wilt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISP-3, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (i) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-1 under the control of a 
tetracyline repressible promoter, and (ii) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24.3. WISP-1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISPS 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to >30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine-rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsn) to the cell membrane (1, 2, 6). Dsh then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3/3 (GSK-3/3) resulting in an increase in 
j3-catenin levels. Stabilized /3-catenin interacts with the tran- 
scription factor TCF/Lef 1, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
0-catenin levels (9). APC is phosphorylated by GSK-3/3, binds 
to /3-catenin, and facilitates its degradation. Mutations in 
either APC or 0-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and retractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-1 
and WISP-2, and a third related gene, WISP-3. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor; CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778, AF100779, AF100780, and AF100781). 
tTo whom reprint requests should be addressed, e-mail: diane@gene. 
com. 
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cDNA was synthesized from 2 /u,g of poly(A) + RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 /ig 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WISP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISPS were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 i*M of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40). 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centnfugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2 (Act) where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA The 
d-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The W75P-specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-Elmer Applied Biosystems. 

RESULTS 

Isolation of WISP-1 and WISP-2 by SSH. To identify Wnt- 
1-inducible genes, we used the technique of SSH using the 



mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on /3-catenin levels (13, 14). Expression of WISP-1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-1 gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-1. The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of «*40,000 (M r 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-linked glycosylation sites 
and are 84% identical (Fig. 24). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of ~27,000 (Af r 27 K) (Fig. IB). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-linked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1. WISP-1 and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-1 (A) and 
WISP-2 (B) expression in C57MG, C57MG/WnM, and C57MG/ 
Wnt-4 cells. Poly(A) + RNA (2 jig) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP-1- specific probe 
(amino acids 278-300) or a 190-bp WISP-2-spccihc probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human /3-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WISP-1 (A) and mouse and human WISP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-1. 

Identification of WISPS. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3. A full-length human 
WISP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP-2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3/1). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-1 , WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma cells, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov. CTGF is a chemotactic and mitogen ic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 35) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin-like growth factor (IGF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerization and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISP-3 was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISP-3 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 




Fig. 4. (A, C, E, and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
D. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power (A and B), 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (E and F), expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing V/ISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (27, 29). 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fold) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5 . Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 jxg) 
digested with EcoRl (WISP-1) or Xbal (c-myc) were hybridized with 
a 100-bp human WISP-1 probe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc genes are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1, WISPS RNA was overexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISP-3 ranged from 
4- to >40-fold. 



DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR-based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
Wnt-1. 

Three of the genes isolated, WISP-1, WISP-2, and WISP-3, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., j3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through j3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyril, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-/3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v j3 3 serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-/31, which is the stimulus for 
stromal proliferation (34). TGF-/31 is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This rinding suggests that paracrine signaling 
could occur in which the stromal cells could supply WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast rumors supports this 
paracrine model. 

An analysis of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP-2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and /3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic 0-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. 
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Baker for technical assistance, P. Dowd for radiation hybrid mapping, 
K. Willert and R. Nusse for the tet-repressible C57MG/Wnt-1 cells, V. 
Dixit for discussions, and D. Wood and A Bruce for artwork. 
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method* Peptides AENKor AEQK were dissolved in water* mode isotonic with 
NaQ and diluted into RPM1 growth medium. T-cdl-proliferadon assays were' 
done essentially as described 21111 . Briefly, after antigen pulsing (30y,gmT l . 
TTCF} with tetrapeptiaes (l-2mgmr'), PBMCs or EBV-B ceDs -were 
washed in PBS .and fixed for 45 s in 0.05% glutaraldehyde. Glycine was added 
to a final concentration of OJM and the cells were washed five times in RPMI . 
1640 medium containing 1% FCS before co-culture with TV cell clones in 
round-bottom 96-well microti tre plates. After 48 h, the cultures were pulsed 
with I p,Cs of ^-thymidine and harvested for scintillation counting 16 h later. 
Predigestion of native TTCF was done by incubating 200 ngTTCF with 0.25 |ig 
pig kidney legumain in 500 pi 50 mM citrate buffer, pH 55, for i h at 37 °C 
GlycopeptWo digestions. The peptides HTONEEDI, HmN(N-glucosanune) 
EED1 and. HlDNESDI, which are based on the TTCF sequence, and.. 
QQQHlJFGShmT)C5GNFCLFR(K3dc),,which is.based on human transferrin t 
were obtained by custom synthesis. The threc C-terrninal lysine residues were - 
.added to the natural sequence to aid solubility. The transferrin glycopeptide. 
QQQHLFGSNVTDCSGNFGLFR was prepared by cryptic ( Promega) digestion, 
of 5 mg reduced, carbory-rnethylated human transferrin followed' by 
concanavalin A chroma'tograpEy ll ..Xjlycopeptides corresponding to residues 
622-642 and 421^452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N- terminal sequencing. The lyophilized transferrin- 
derived peptides were redissorved in 50 mM sodium acetate, pH 55, 10 mM 
dithiothreitol, 20% methanoL Digestions were performed for 3 h at 30 °C with 
5-50 mUmF 1 " pig kidney legumain or B-cefl AEP. Products were analysed by 
HPLC or MALDI-TQF mass* spectrometry using a matrix of lO.mgrar 1 a- 
cvanocinnamic add in 50% acctonitrilc/O.] % TEA and a PtrSeptive Btosystems 
Elite STR mass spectrometer set to linear or. reflector mode. Internal standar- 
dization was obtained with a matrix ion of 568.13 mass- units. **• 

Received 19 September, accepted 3 November 1998. ; 



I. Chen, J. M. tt at Cloning, isolation, and characterisation of ma 
endopeptWaie./. BtaL Qtatt 272, 8090^8098 (1997). 



laBan kjuj7uin, ^n asparzginri 



t KonHiJ«,^A^Bo^^DJ. 1 Ki^t.aa5(B»rtctt,A.t f rhetwo( 

jcatepori&culontjvi durioeraation by use ofrpecifie Quorometne assay*.//!*. Biodiem.Bv>phy± . 
30J.20B-213O99S). _ 

* DaJtan, J. R, Hola lamraka, L & BrkflejvP. }. Aspiragfoyi crxfopeptidase tairivr in adull Schistosoma 

nrnfonLfyxajbokgy 111*5757-58011995); -;. *} v 

A. Bennett. K. tt at Antigen processing .for prcscmation" bf dais tt major hinocompitibUtty compter . 
require dcanp by calhcsptn E Ettr.J. tmmunot 22*15 19- 1524 (1992). . . 
• 5. . Rioc R. t rtoi Essemni role for caUitpsia S in MHC daw It-suocUxed Invariant chain processing 
' aaad peptide loading- Snunumty-i, .357^366(1996). 
6. Rodriguez, CLM& Dimcnt, S. Role of cathepsin D in arnigen procntauon of oyaJbmnio. A ImmuuoL 
. MS\2B94-2»98(1992). -w 
' 7. Hewitt, E- W. <f at. Nitvj^J piotasin^ ittci f<x bunun cathtpiin £ and atr*piin D in cctumi toxin: 
wnpikatiom for Tjcefl^tope generation: U Immunol 159, 4693-4699 (1997). 
ft. WitH. C C^arc a^J^trooiartof erogrrxxu intigcm for presrattdonon MHC mobcuk*, Anna. 
Rrr. hnmm&lS* S2i~B504997). 

10. fiocscr*^ MiUcr, J. EoAwonui proteases and wrngta prooiung. Trtnds BiochonSti 22, 377-352 

£ " : • . ' 

XL lai»>& Yin Halbcck. H. Cornpkte 1 H and -"C ce*c«wncc assignments of » 71 -amino add ghropeptfde 
^prepajedrVcrnhuirianser^ (1996). 

12. ftaron, D. 11 &.Lodcsky, R.M. The instructive rok of Innate imnntrury in the acquired Iromone 
raporae. Soma 272, 50-54 (1W6).* 

13. Medzhitow. R. ft ftneway, t. A_ JL Innate icnmuniiyt'rbe virtues of a nondonal ayrtem of rceognirJon. 
OH 91, 295-293 (1997). 

M* R. tt d. the tntigenknmcrure of the HfY gpl20 envdopc ^ycoprntein. Katun 393. J05-7I I 

15: BoardR P. rt ol N-grreoiriitbn of HIV. gp 120 may-conurain recognition by T lymphocyte*. /. 
■ hammol 147, 3128-3132 (1991 ). 

16. tJavldscm, H. Wt«. M: A. fit Wattx. C EndocytoiU, in tnceifolar irainddng. and processing of 
membrane IjG and monovalent antigen/membrane IgG complaca in B rymphocyU*. /. Immunol 
AH, 4IOi^t<fc 0990). 

17. BafretUA. j.& KlrsdikcKi Catfccpsin B, aihrpdn H and athcpUn ll MdhodsEnzymoL 60,535-559 
0981). ' 

ia-'Mifa>tT, A- U BriUntine, S. P., Small wood. A-E. 5r Fafrweather, N. F. Expreoion of uunitf toxin 
ftapneot C in-H. co/c iu purification and potential use at a vaccine. Bhuxhtob& i', 1043-1046 
(1989). 

19. tine, & R Be rUrtow, L Antibodies: A Laboratory Main*}. (Cold Spring Harbor laboratory Press, 
19W), 

TO. Uroavccdib. A. Anrigtn-apcdfic interaction between TindBcdk Nojurt 314, 537-539 (1985). 
2f, Pbni Lft W«tu,C Cbarkacrtiation of transport of newt/ assembled, T ccU:*rimulaiory MHCdw 
. n -peptide complexes from MHC das* II compartments to the cdj'sor&ce,/. Immunol 159, 543-553 
(1997). 

AcknowWgeo3enU.Wc thank M» Ferguson for helpful discussions and advice E. Smyth e and L. Grayson 
for advid: and technical ainitance;'a. Spruce, A, Kmght and the BTS (NlnewdU Hdspfnl) for hdp wWi 
bbod monocyte prqjaradon; and our colleagues fir many befpfoJ comments on the mantueript. This 
wbtk wa supported by the Wdlcoroe Trust, and by tn EMBO Long-term fellowship to B.M. 

Correspondence and requests for materials should be addressed to CW. (e-mail: cwttt^uitdccxudt). 




Genomic amplification of a 
decoy receptor for Fas ligand 
in lung and colon cancer 

Robert MJPittH, Scot Marstqrsn, David A. Lawrence^t, 
Margaret Roy% Frank C. Kischkel*, Patrick Dowd*, 
Arthur Huang", Christopher J. Donahues 
Steven W. Sherwood*, DaryK T, Ba!dwin\ Paul J. Godowski% 
William r. Wood% Austin L. Gurney% Kenneth J. HIUan% 
Robert L, Cohen*, Audrey D. GoddardVDavid Botste!n^ 
. & AvI AshkenazI* 

'Departments of Molecular Oncology, MoleaikrSwhgy, and Immunology/ 
Genentech Inc, I DNA Wa# South San Francisco, California 94060, .USA 
t Department of Genetics, StanfordVmvmity, Stanford, California 94305, USA 
1 These authors contributed equally to this work 

Fas ligand (Fast) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells through the death receptor Fas/Apol/CD95 (ref. 1). 
One important role of FasL and Fas is to mediate immune- 
cytotoxic killing of cells that are potentially- harmful to the 
organism, such, as virus-infected or tumour cells 1 . Here .we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), mat binds to FasL arid inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus; certain tumours 
may escape FasL-dependent unmune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL. . ' : . 

By searching expressed sequence tag (EST) databases, we identi- 
fied a. set of related fiSTs that showed homology to the tumour 
necrosis factor (TNF). receptor (TNFR) gene superfemily 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 

. protein encoded by this cDNA decoy receptor 3 (PcR3 j. The cDNA 
encodes a 300-anuno-add polypeptide that resembles members of 
the TNFR family (Fig, la): me ammo- terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich d omains 
(CRDs).jLike oneother TNFR homologue, osteoprotegerin (OPG) 5 , . 
pcR3 lacks an - apparent transmembrane sequence, which indicates 
that it may be a secreted; rather than a . membrane-asscodated, 
molecule. We expressed- a recombinant, histidine-tagged form of 
£)cR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamjde gels as a protein of- 
relative molecular mass- 35,000 (data not shown). DcR3 shares 

. sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%).. All of 
the cysteines in the four CRDs"of DcR3 and OPG are conserved; 
however, the carbdxy-terminal portion of.D'cR3'is 101 residues 

. shorter. ' / , . 

We* analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. Ib). VSTe detected a predbnunant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and lung: In addition, we observed relatively, high DcR3 raRNA 
expression in the human colon carcinoma cell line SW480. 

. To investigate potential Hgand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
0cR3-Fc to human 293 cells transfected. with mdividual TNF- 
family ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL* (Fig. 2a), but not to cells transfected with 
TNF 5 , Apo217TRAIL*- 7 , Apo3LnWEAK w ,. or OPGI/TRANCE7 
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BAJSTKL 10 " 12 {data not shown).' DcR3-Fciminunoprecipitated'5hed 
lasL from RsL-transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Figv 2c) t as did the Fc-tagged ectodomain of Fas but not 
1NFR1. :GeI-fiItration chromatography showed that DcR3-Fc and 
soluble FasL fonned a . stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL .with a. comparable - affinity {K d = 0.8 i 0*2 and 
1.1 ± O.lnM, respectively; Fig. 2e)» and that DcR3-Fc could 
Wbck nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, , 
inset) . Thus, DcR3 competes with Fas for binding to FasL 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested, the effect of DcR3-Fc on apoptosls induction by soluble ' 
FasL in Jurkat T leukaemia cells, which express Fas.(Fig. 3a). DcR3- 
Fc and Fas-Fc . blocked soluble-FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
—0.1 ftgmT*. Time-course analysis showed that the inhibition did 
'not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3^Fc on activation- 
induced cell death (AICD) of mature T lymphocytes, a FasL- 
dependent process 1 . Consistent with previous results", activation 
of mterleukin-2-stimulate4 CD4-positive T cells with anti-CD3 
antibody increased, the level -of apoptosis twofold, and Fas-Fc 
Mocked this effect substantially (Kg. 3c); DcRMFc blocked the 
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induction of apoptosis to a similar extent Thus* DcR3 binding • 
blocks apoptosis induction. by FasL 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves- perforin and 
grarizymesw^ 16 . peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Hg. 3d)r PcR3-F c< 
and Fas-Fc each reduced killing of target cells from —65% to 
—30%, with half-maximal inhibition at ~ 1 itgmT 1 ; the residual 
killing was probably mediated by &e perforuVgraiayme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and Fas-Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL. activity, which is consistent with the greater 
potency of membrane-associated FasL compared with* soluble 
FasL 17 . • : ; v - x ! ■ •. ■ ■ • . 

Given the role of immune -cytotoxic cells in. eliminatibn of 
tumour cells and the feet that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3..expressibn might, contribute to the 
ability of some tumours to escape immune-cytotogdc attack. As 
genomic amplification frequently contributes to tumorigenesiSj we 
investigated whether , the DcR3 gene is amplified in cancer*. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 



figure 1 Primary structure and expression pf human Dcfl3. a; Alignment of the 
amlnc-add sequences of DcR3 and of osteoproteaerin(OPG): the Oterminal 101 
residues of OPG ere not shown. The put&tlve signal cleavage site (arrow), the 
<^e1ne-richdom3fns(CRD 1*4] t andtheA/-BnkedgfycosyiatJon site (asterisk} ere 
shown, b, Expression of DcR3 mRNA. Northern hybridization analysts was done . 
using the DcR3.cDNA as a probe end blots of poMAf UNA (Clontech) from 
human- fetal end adult tissues or cancer cell lines. PBU peripheral blood 
lymphocyte. 
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Figure 2 Interaction of DcR3 vyith FasL a* 293 cells were trensfected with pRI® 
vector (top) or wfth'pRKS encoding Wength- FasL (bottom), incubated with' 
DcR3-Fc (solid line, shaded ' area),' TNFR1 -Fc (dotted Kne) or buffer control 
(dashed line) (the dashed end -dotted tines overlap), and.analysed for tshding by: 
FACSi Statisticel analysis showed a s^lgnrftcant difference (P < Oj001) between the' 
binding of DcfWrFc to. cells trensfected with Fast, or pRK6'. PE phycoerythrfrv 
labelled ceils, b, 293 celte were transfected as-lna and metabollcaily labelled, and 
cell supematants were immunopreclpitated with Flagged TNFTU DcR3 or Faa, 
c Purffted soluble FasL (sF8sL)was immunopreclpitated with TNFR1-Fc, DcR3- 
Fc or Fas-Fc and visualized by Immunoblot wtth aritl-FasL antibody. sFasL was. 
loaded directly for comparison in the right-hand lane, d, Rag-tagged sFasL was 
incubated whh 0cR3-Fc or with buffer end* resolved by gel filtration: column 
fractions were enarysed h an assay that detects complexes corrta(nlr^'DcR3-Fc 
end sFast-Fag." e. Equilibrium binding of OcRfr-Fc or F«3-Fc.to s^asl-Rag; 
Inset, competition of DcR3-Fc whh Fas-Fc for binding to sFasL-rFlag: 
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reaction (PGR)" in genomic E>NA from 35 primary lung and colon 
tumours, relative to pooled, genomic DNA from peripheral blood, 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and. 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18-fold (Fig. 4a, b)l To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
. of DcR3-based PGR primers. and probes; we* observed nearly the 
same amplification (data not shown). '.' 

"We men analysed DcR3 mRNA expression in primary tumour 
tissue sections by in dtu hybridization; We detected DcR3 expres- 
sion in 6 out of 15 lungtumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and 1 out of I gastric tumour, (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c DcR3. mRNA was localized to infiltraung malignant epithe- 
Hum, but was essentially absent from adjacent stroma,, indicating 
tumour-specific expression* Although the Individual rumour sped- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DeR3 gehe is amplified frequently in tumours. 
STW480' colon carcinoma cells,; which showed abundant DcR3 
mRNA expression (Fig, lb), also. had marked DcR3 gene amplifica- 
tion, as shown by quantitative PGR (fourfold) and by Southern blot, 
hybridization (fivefold) (data not snown). '• 
■ . If DcR5 amplification' in cancer is functionally relevant, then 
DcR3- should be- amplified more than neighbouring genomic 
regions that are not important for tumour survival To test mis, 
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• figure 3 Inhibition of FasL activity by DcR3. a,,Human JurkarT leukaemia cells 
•.were Incubated with Rsg-tagged soluble FasL (sFasl; 5ngmr ! ) btlgomerfeed' 

• with entmag antitxjdy (0.1 jigmn 1 ) h> the presence of the proposed Inhibitors 
' DcR3-Fc. Fes-Fc or human IgGl andassayed for apoptosls (mean * s.e,m. of 
triplicates), b. Junta cells were Jncubated wiih sFasL-Flag plus antw=tog antibody 
as In a, In presence of 1 |tg.mr' OcR3-Fc (fined circles}, Fas-Fc (open circles) or 
human IgQl (triangles), and.epoptosls was determined at the indicated time 
points, c Peripheral blood-TceOs were stimulated wltrj PHA and interleukln-2. . 
followed- by control (white bars) or entKm antibody, (filled bBrs). together with 
phosphate-buffered saline (PBS),- human IgGl. Fas-Fc, or 0cR3-Fc (10 ^g mr 1 ): 
Afteue h, apoptosls of rfrJr cells was deterrrtned (mean ± s.e.m. of results from 
five donors), d; Peripheral Wood natural killer ceHs were Incubated with *Cr- 

• labelled Jurket cells in the presence of DcR3-Fc (filled circles). Fas-Fc (open 
circles) or humen • IgGl (triartgjesj. arid target-cell death was determined by 
release of B, Cr (mean * ad. for two donors, each in triplicate). 



we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR5 showed linkage to marker AFM218xe7 (Tl60),whichmaps to 
chromosome.position 20ql3. Next, we isolated from a bacterial 
amfiaal chromosome (BAQ library a human genomic done that 
carries DcR3, and sequenced the ends of the clone's insert We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and .of 
seven genomic, markers that span chromosome 20 (Kg: 4d). The 
DcR3^1inked reverse marker showed an average amplification of 
roughly threefold,, slightly less than the approximatery fourfold 
amplification of DcR3; the other markers, shbwed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre' of a distal chromosome 20 regidn that is amplified in colon' 
cancer, consistent with, the possibility thaxDcR3 ^amplification 
promotes tumour survival. :•.* 

Our results show that DcR3*inds specificaliy to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNT- 
ligand-fomily members; however, this does not rule out the possi- 
bility that DcR5 interacts with other ligands, as do some other 
TRFR family members, including OPG* w . 

FasL is important in regulating the immune response; however 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFttP, which modulates apoptosis signal- 
Img downstream of-Fas 20 . A second mechanism involves proteolytic 
shedding of FasLfromme cellsurrace 17 . DcR3 competes with Fas for 



« h ~ d 




Rgure 4GenomiC amplification of DcR3 In tumours, a, lung cancers, comprising ■ 
eight adenocarcinomas (c. d, f. g. h. J, k, r),seven squemous-ceJI carcinomas (a. e. 
m. f\. o. p. qj, one non-smaD-cell carcinoma (b). one smaftcell carcinoma % and 
one bronchial a darrc carcinoma (l);.The data ere means ± 8.d. of 2 experiments 
done In duplicate, b. Coton tumours, comprising 17 adenocarcinomas. Data are 
means.± s.e.m. of five- experiments done In duplicate: e, In situ hybiidbatfon 
analysis of DcR3 mRNA expression in a squamous-cen carcinoma of the lung. A 
representatfve brJght^field Image (left} and the corresponding, dark-fteid'.lmage 

• (rtght) show 0cR3 mRNA oyer Inffltrkting maligrfam- epithelium (arrowheads). 
Adjacent norvmaflgnant stroma (S), blood vessel(V) and necrotJp.tumour tissue 
(N) arealso shown, d; Average amplfHcation of DcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and' forward, Rev and J=wd), the 
DcR3-linked marker Tieo, andother chromosome-20 markers, In the nine colon 
tumours showing DcR3 amptiheetfon of twofold or more (b). Data are ftom W 
experiments done in duplicate/Asterisk Indicates P<om for a Students Nest 

. comparing each marker with DcR3. * 
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hsL binding; hence; it may represent a third mechanism, of 
ettraceflular regulation of FasL activity. A decoy receptor, that 
modulates the. function of the- cytokine interleukin-1 has -been 
4escnT>ed?V In addition, two decoy receptors that belong to the- 
Tr^ ramily, Dcftl and DcR2, regulate the FasL,related apoptbsis- 
inducing molecule ApoU 22 . Unlike DcRl and Dcitt, which are 

• membrane-associated proteini DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR-ramtiy member is 
OPG 5 ; which shares, greater sequence homology with DeR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a. third 

: cecoy.for Apo2L ! *. Thusj DcR3 and OPG define a. new-subset of 

• TOTR-faroily members that function as secreted decoys- to mod- 
ulate hgands that induce, apoptosis. Pox viruses produce soluble 
TNFR- homologies that neutralize/ specific TN^rarnfly- Ugands, 
thereby, modulating the antiviral immune response 2 . Our results. 

" indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours.. □« 

Methods ' " 

' feolatfoh of Dcft3 C0NA. Several overlapping ESTi in Genfiank (accession 
numbers AA025672, AA025673 m& W67560) and in lifeseq™ (Incyte 
PhaimaceuBcals; accession numbers 1339238, 1533571,1533650, 1542861, 

.. 17.89372 and 2207027) showed similarity to members of the TNPR ramify. We 

' screened human cDNA libraries; by PGR with primers based on the region of 
EST consensus; fetal hing was positive for a product of the. expected size. By 
hybridkadon-taa PCR-generated probe based -on the ESTs, one positive done 
(DttA30942]T was, identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, wc isolated 50" 
more clones; the coding regions of these clones were identical in size to.that of" 

" the initial done (data not shown). ' . " jr ^ \ '■■ • 

Fofuslon proteins (Immunoadheslns), The entire DcF3 sequence, or the 

- ectodomain of Fas or TNFRl, was. fused to the Hngeand Fc region ofhurnan 
lgGl, expressed m insect-SF9 cells or V human '2^*. cells, and purified. as 

'described". / ... !»;.:■"• . 

Ruoreacenee-activaM cell sorting (FACS) anafysls. We transfeeted 293 
cells using calcium phosphate or Effectene (Qiagen) with pRKS vector or pRK5 
encoding fiilHength human Fast' (2 fig), together with pRK5 encoding CrmA 
(2pig)- to- prevent cell death. After I6h, the ceQs were incubated with 
biotmylated DcR3 T Fc or pfFRl-Fc and then with phycoerythrm-conjugated 
streptavidxn (GibwBRLftand were assayed by FACS. The data were analysed by 
Xolrnogorov^rnirnoy statistical analysis. There was some detectable staining 
of vector-transfe^ted crib by DcR3-Fq as these cells express little FasL (data 

• not shown),, it' is possible that DcR3 recognized some other fector that, is. 
■ expressed cdnstitutively on 293" cells. 

lmmunoproc!pHatioR. Human 293 cells were transfeeted as above,. and- 
mctaboucafiy labelled with ( 3S S]cysteine and [*S] methionine (OS mG; 
Amersham). After 16h.of culture in the presence of z-VAD-rmk ( 10 \iM), 
tie medium was tmmunoprecipitated with DcR3-Fc, Fas-Fc or TNFRl -Fc 
(5 fig), followed br protein A-Sepharose (Replfgen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified, Hag-tagged soluble FasL (I n#) (Alexis), was incubated 
with each Fc-fiision protein (1 u.g)i precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anriY 
FasL antibody (Oncogene Research).. 

Analysis of complex formation. Flag-tagged soluble FasL {25 >ig) was 
mcnbatedwthbun%rorvflmDdl3~Fc(40u.g^ for 13h at 24 °C The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS* 0.6-ml tractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysedby placing 100 uJ aliquots into microtitre 
wells prccoated with anti-human IgG (Boehringer); to capture DcR3-Fe, 
foDowed by detectionwith baotinylated anti-Flag anuT>qdy Bio M2 (Kodak) and 
streptavidm^horseraoish peroxidase (Amersham}. CaUbration of the column 
indicated an apparent rdative^molecular mass/of thecorajdex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimen 
to two soluble FasL homotrimers. 

Equilibrium binding analysts. Microtitre wells were coated .with anti-human 



IgG, blocked with 2% BSAin PBS: DcR3-Fcor Fas-Fc was added. foDowecl by 
serially dflnted Flag-tagged soluble FasL. Bound ligand was detected with arm- 
Hag antibody as above. In the competition assay, Fas-.Fc was immobilized as 
above, and the wells were blocked with excess IgG! before addition of Flag- 
tagged soluble FasL plus i>cR3-Fc . 

T^alJ AICD. CD3* lymphocytes were isolated from peripheral blood of 
individual donors using anti-CD3 magnetic beads (Mfltenyi Biotech), 
stimulated with phytohaernagglirtinin (PHA; 2 ngmT^ for 24h, and cultured 
mmepresiinceofmterleukin-2 (ITOUmT^Kor 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed ferapoptosis 
16 b later by FACS analysis'of armexm-V-binding of CD4* cells" 
Natural killer call activity. Natural kOler cells were isolated from peripheral 
blond of individual donors using anti-QD56 magnetic beads (Mfltenyj 
.Biotech), and incubated for 16h with s, Cr-loaded Jurkat cells at an efifector- 
to-target ratio of 1:1. in. the presence of DcR3-Fc, Eas^Fc or. human IgGl; 
Target-cell death was oeteniuned by release qf . s, Cr in effector-target co- 
cultures relatiye.tojelease of SI Cr by detergent lysis of equal numbers of Furkat 
cells. . 

Gone-amplhicaUon analysis. Surreal specimens were provided by J; Kern 
(lung tumours) and P. Quirice (colon tumours). Genomic DNA was extracted 
(Qiagen) and the. concentration, was determined us^ng Hoechst dye 33258 
intercalation fluorometry. Amplification was detennined by quantitauvePCR** 
usmg a TaqManmstrument (ABI). The method was validated by comparison of 
POt and Southern hybridization data for the Myc and HER-2 oncogenes (data 
not shown). Gene-specific primers and. fluorogenic probes were designee! on. 
the basis, of the sequence of DcR3 or of nearby regions identified on. a BAG 
carrying the human DcR3 gene; alternatively, primers and probes were- based 
oh Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159); the nearest 
avaSable marker which maps to -500 kilobases from T160, and five extra 
markers that span chromosome 20. The bcR3-spedfic primer sequences were 
5'-CrrCITCGCGCACGCnG,3' and 5' -ATGACGCGGGCACCAG-3'- and the 
fluorogemc probe sequence was 5'-(FAM-ACACGATGCGTGCTCCAAGCAG 
AAp- CTAMARA), where ?AM, is 5 r -fluorescein plmsphoramidite. Relative 
gene-copy numbers were derived using the formula 2 (ACn ; where ACT is the 
difference in amplification cycles required* to. detect DcR3 in pcripheralllood 
lymphocyte DNA compared to test DMA. 
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ABC transporters (also known as traffic ATPascs) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across, membranes of both prokaryotes and 
eukaryotes 1 . The recently completed Escherichia coB genome 
sequence revealed that the largest family of paralogous^fi'^Zi 
proteins is composed of ABC transporters 2 . Many ^ultaryotic 
proteins of medical significance belong to thisyramiiy/ such as 
the cystic fibrosis transmembrane conductanceiegulator (CFTR), 
the rVg^proteih for multidiiig-resisW^ and the 

hetcrodimeric transporter, assodated^whfi. antigen processing 
(Tapl-tap2); H ere we report the crj^structure at IS A resolu- 
tion of HisP, the ATP-binding subunit of the histidihe permease, 
which is an ABC transporter fttm>$abnotteBa typhimuriurru We 
correlate the details of this structure with the biochemical; genetic 
and Diojmysical properties -of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFFR proteins. 
' - ABC transporters contain four structural domains: two nudeo- 
We^mome^niomflins (NBDs), which are highly conserved 
toougtfout the fomily; and two transmembrane " domains 1 . In 
■^okaryotes these domains are often separate subunits. which are 
assembled into a membrane-bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain* The 
periplasmk histidincpermease ofS. typhimurium BtidRcoW^ is a- 
weD-characterized ABC transporter that is a good model for. this 
supexnunily. It consists of a membrane-bound complex, HisQMP 2 » 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP,. the ATP-binding subunit HisP, which has 
properties intermediate between those of integral and peripheral 
membrane proteins', is accessible froinboth sides.of the membrane, 
presumably by its interaction with HisQ and HisM 6 ; The two HisF 
subunits form a dimer,. as shown by their, cooperativity in . ATP 
hydrolysis 3 ; me requirement for both subunits to be present for 
activity 4 , and the formation of a HisF dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer 3 : HisP has been purified, 
and characterized in an active soluble form 5 which can be recon- 
stituted intp a fully active membrane-bound complex 1 ; 
- . The overall shape of the crystal structure of the HisP.monomer is 
that of an *L* with two thick arms (arm I and arm. II); the ATP- 
binding pocket is near the end of arm I (Fig. I). A six-stranded p- 
sheet (P3 and P8-0 12) spans both arms of the L, with a domain of a 
a- plus p-type structure (pi, 02, p4-P7, al and a2) on one side 
(within arm I) and a domain of mostly a-helices (o3-ra9) on the 




.flBure r Crystal structure of HlsR a,- View of the dimec. along en axis 
perpendicular to its tworfofd axis..The top and bottom of the dimer are suggested: 
to face towards the perlplasmic and cytoplasmic sides, respectively (see text). 
The thickness of arm Ills about 25 A,' comparable to that of membrane. a-Helices 
are shown In orange and a-sheets In green, b; View along the two-fold axis of the 
HlsPdlmer. showing the rotative displacement of the monomer* not apparent in 
a. The 0-strands atthe dimer interface are labelled, c; view of one monomer from 
the 'bottom of arm I. as shown in -a. towards arm II. showing, the ATP-blndfng 
. pocket a-c, The protein'enii the bound.ATP are in 'ribbon' and 'baD-entfistick 1 - 
•representations, respectively. Key residues discussed in the text arehdicated in 
a These, figures were prepared with MOLSCAIPT" N, amino terminus; C C 
terminus, " 
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• Gene amplification is a common event in the progression of ' 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative poiymerase-chain-reaction (PGR) method, 
based on fluorescent TaqMan methodology, and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
■ of measuring fluorescence Jn real-time, was used to quantify 

- gene amplification In tumor DNA. Reactions are character- 
ized by the point duringt:yc1ing when PCR amplification is still 

.in the exponential phase, rather than the amount of PCR. 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible In reac- 
tions starting with the same copy number. This greatly 
improves, the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product carry-over con- 
tamination;, it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The realtime PCR method, was used to develop and validate , 
a simple and rapid assay for the detection and quantification 
of the 3 most, frequently; amplified genes (myc, ccnd 7 and 

- eroB2) in breast tumors. Extra copies of myc.ccndl and erbB2 
were observed fn 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18.6 and 15.1, respectively, these results correlated 
well with those of Southern blotting. The use of this new 

. semi-automated technique will make molecular analysis of . 
human- cancers simpler ami more reliable; and should find 
broad applications in clinical and research settings, int. L 
Cancer 78:661-666, 1998. 
&J998 miey-Liss; Inc. 

Gene amplification plays an important role in the pathogenesis 
: of Various solid tumors^ including breast cancer, probably, because 
over-expression of the amplified target genes confers a selective. 

- advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosornal double 'minutes 
(dmins). or as integrated homogeneously staining region* (HSRs), 

: are among the main visible* cytogenetic abnormalities in' breast 
tumors: Other techniques such as comparative genomic, hybridiza- 
tion (CGH) (Kallioniemi eval. 1994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identity 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include mvc (8q24), 
ccndX (H.ql3), and eroB2 (1 7ql2-q21) (for review, sec Bieche and 
Lidereau, 1995): 

Amplification of the myc, ccnd I, and erbBl proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identity 
sub-populations with a worse prognosis (Berns et al,. 1992;. 
Schuuring et al, 1992; Sfamon et all, 1987). Muss ei al (1994) 
suggested .that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
anp* hormone therapy. ■ 
• However, published results diverge both in terms of the fre- 
quency of these alterations- and their clinical value. For instance, 
over 50Q studies in 10 year* have failed to resolve the controversy. 



surrounding the .link suggested by Slamon et al (1987) between 
erb&l amplification and disease progression. These discrepancies • 
are partly due to the clinical, histological and ethniabeterogeneity 
of* breast cancer," but technical considerations are also probably 
involved. . * 

Specific genes (DNA) were initially quantified in tumor cells by 
means of blotting procedures such as Southern and slot, blotting. 
These batch., techniques require large amounts of DNA (S-riQ 
ug/reaction) to yield' reliable quantitative results. Furthermore, 
meticulous care, is required at all stages of the procedures tq= 
generate blots of sufficient .quality for reliable dosage analysis! 
Recently, PCR has proven to be a powerful' tool for quantitative 
DNAanalysis, especially with minimal starting quantities of tumor 
samples (small, early-stage tumors- and formal in-fixed, paraffin- 
embedded tissues): 

Quantitative PCR can be performed by .evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a. varying number of cycles during the 
. exponential phase (kinetic quantitative PCR> ; In the first case, an 
. internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. ; 

. One of the majbradvahtages of the kinetic method is its rapidity; 
in quantifying a new gene, since no internal standard is required (an 
external standard curve is sufficient): Moreover, the kinetic method 
.has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay- mix at. 
regular intervals and quantifying, for .each aliquot, the amplifica- 
tion product Interest in the kinetic method has been stimulated by a 
novel approach using.fluorescent TaqMan methodology and a new * 
• mstruinent (ABI Prisra 77O0 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et al, 1996; Heid et 
al, 1996). The TaqMan reaction is baseil on the 5' nuclease assay 
. first described, by Holland et al (1991): The latter, uses the 5' 
nuclease activity of Taq polymerase to.cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The. 
approach uses dual-labeled, fluorogenic hybridization probes (Lee 
et al, 1993). One. fluorescent dye, co-valently linked to the 5* end- 
of the oligonucleotide, serves as a.reporterlFAMfre., 6-carbbxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA (Le., 6^»rboxy-tetramemyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydrolyzed by the 
5'-3' nucleolytic activity of BNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission,, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the. reporter dye (FAM) by the emission intensity of a 
reference dye (i.e., ROX, 6^rooxy-X-rhodamine) included in • 
TaqMan buffer, to obtain a ratio defined as the Kn (normalized 
reporter>for a given reaction tube. The use of a sequence detector 
enables, the -fluorescence , spectra of all 96- wells of the thermal 
cycler to be measured continuously during PGR amplification; 
. The real-time PCR method offers several rad vantages over other 
current quantitative PCR methods. (Celr et aL, 1994); (i) the 
probe-based homogeneous. assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a- 
signal; fii) the Q (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product, accumulation. This is the main reason why Q is a more . 
reliable measure of the starting copy number than are end-point . 
measurements,^ which a slight difference in a limiting component, 
.can have a drastic effect on the amount of product; (Ui) use of Q 
'values gives a wider dynamic range (at least S orders, of magni- 
tude), reducing the need for serial dilution;'^ The real-time PCR 
method is run in a closed-tube* system and requires no post-PCR 
• sample handling,, thus avoiding potential contamination; (y) the • 
system is highly automated, since the instrument continuously' 
measures fluorescence in all 96 wells of the thermal cycler during 
-* PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
availablejust one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96* reactions can be 
. analyzed in 2 m\ 

Here, we applied this semi-automated procedure tp determine 
the copy numbers of the 3 most -frequently amplified genes in breast 
tumors (myc w ccndl and er&B2),.as well as 2 genes (alb and app), 
located in a chromosome region in which no. genetic changes have 
been observed in breast tumors. The results for 1 08. breast tumors 
were compared with previous Southern-blot data for the same 
samples. . 



MATERIAL AND METHODS' 
Tumor and blood samples. ' 

Samples were obtained from 108 primary breast tumors removed 
surgically from patients at the Centre Rene-Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy.. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molecular- weight DNA. Patients 
were included in. this study if the tumor sample used for DNA 

• preparation contained' more than 60%. of tumor cells (histological 

• analysis): A blood sample was also taken from 18 of the same 



DNA was. extracted from tumor tissue and blood leukocytes 
according.to standard methods. 

Real-timePCR 

Theoretical basis.. Reactions are characterized by the point 
.during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PGR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed The parameter C, (threshold cycle) . is 
defined as the- fractional cycle, number at which the fluorescence 
generated by cleavage of the probe: passes a fixed threshold above 
baseline. The target gene copy .number in unknown samples is 
. quantified by measuring Q and by using a standard curve to 
determine the siarting copy number. The" precise amount of 
genomic DNA (based on optical density) and its quality (Le. t lack 



of extensive degradation) are both difficult to assess. We therefore- 
abaquantified a control gene (alb) mapping to chromosome region 
4qM-ql3, m. which no genetic alterations have been. found irr 
breast-tumorDNA by means of CGR(Kallioniemi e/:a>;,.1994}. 

Thus, the ratio of the copy number of the target gene to the copy 
number "of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed "N", and is determined as follows: 

copy number of target gene (app. myc, ccndl, erbBl) 
copy number of reference gene (alb) 

Primers, probes, < reference human. genomic DNA and PCR. 
consumables. Primers and probes were chosen with the. assistance 
of the computer programs OUgo 4,0* (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer- 
Express (Perlrin-Ehner Applied BiosysteraSi Foster CityjCA).: 

Primers were purchased from DNAgency (Malvern; PA) and 
probes from Perkin-Elmer Applied Biosystems: 

Nucleotide seo^ences for the oligonucleotide hybridization 
probes and primers are available on request 

The TaqMan PCR Core* reagent kit, MicroAmp optical tubes,' 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems.. 

Standard-curve construction: The kinetic, method- requires a 
standard curve. The latteT was constructed' with serial dilutions of 
specific PCR: products, according,, to Piatak et aL .(199$). In 
practice, each specific PGR product was obtained by amplifying 20 
ng. of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same- primer pairs as those used later for 
real-time quantitative PGR. The 5 PCR products were purified 
using MicroSpin S-400 HR column's (Pharmacia, Uppsala, Swe- 
• den} electrophbrezed through an acrylamide gef and stained with 
ethidium .bromide to check ttieir quality. The PCR products were 
then quantified spectrbphotomctrically and- pooled, and serially 
diluted 10-fold in mouse genomic DNAJClontech, Palo Alto, CA> 
at a constant concentration of 2 ng/pL The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from lOr* (10* copies of each gene) to 
10" » (10* copies). This series of diluted PCR products Was 
aliquoted and stored at -80°C until use. ; 

The standard curve was validated- by analyzing. 2 known, 
quantities .of calibrator human genomic DNA (20 ng and 50ng): 

PCR amplification. Amplification mixes (50 pj> contained the 
sample DNA(aroundZ0 ng, around 6600 copies of disomic genes), 
10X TaqMan buffer (5 ul),200 uM aVffP,.dtTP; dGTP, and 400 
uM dTJTP, 5 mM MgCl* 1 .25 units of AmpUTaq Gold\ 0.5 units of 
AmpErase.uracil N-glycosylase (UNG),.2b0 nM .each primer and. 
100 nM probe. The thermal cycling conditions comprised 2 min at 
50°C and 1 0 rain at 95°C. Thermal cycling consisted oi*40 cycles at 
95 Q C for 15 s and 65°C for 1 min. Each assay included: a standard 
curve (from 10 5 to 10 1 copies) in duplicate, ano-template control; 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate; and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96-weIl microplate). All 
samples with a coefficient of variation (CV) higher than 1 0% were 
retested. . 

. All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fiuorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each.of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the da^a are 
analyzed automatically. The software accompanying the "7700 
system'calculates Q and determines the starting copy number in the 
, samples. : . 
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GENE AMPURCATIOM BY REAL-TIME PCR: 



'Determination* of gene amplification. Gene . amplification was 
calculated as. described above. Only samples with an N value 
higher than-2 were considered to be amplified . : 

RESULTS. 

. To validate the method; real-time PCR was . performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients? ' 
The target genes were the myc, ccndl and erb&l proto-oncogenes, 
and- the 0-amyloid precursor protein gene (app\ which maps to a 
chromosome region (21q2i\2)ih which no genetic alterations have 
been found in breast tumors (Kallioniemi et al, 1994). The 
reference, disomic gene was the albumin gene (alb, chromosome 
4qll-ql3): 
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Nidation of the standard curve and dynamic range 
of real-time PCR 

The standard curve was constructed from PCR produce serially- 
diluted in genomic mouse DNA. at a constant concentration of 
2-ng/ul. ft should be noted that the5 primer pairs chosertto analyze 
the S target genes, do not amplify genomic mouse DNA (data not* 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders, of 
magnitudeX with. samples containing as few as TO 2 conies* or as' 
many as 10* copies. 

Copy-number, ratio of the 2 reference : genes Yapp and sib) 

The app to alb copy-number ratio was determined in 18 normal 
leukocyte DNA -samples and all 108 primary breast-tumor DNA 
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sampler We selected these 2 genes because they are located b 2 . reagents and requires', relatively Iaree amounts of Weh-aualitv 

chT^^T/T ( ^;. 21 ^ ?* *l»HJ3> «t which no , genomic DNA-, which means itcan^beS^S to w 
obvious genetic changes (including earns or losses^ haw h**n uk nM>n »» a tins. * .. t" 1 . ? 111 



_ ■ '-o — - \-^rr» i—-— *» — ^*t« * • »tu 

. obvious genetic changes -(including gains or losses) have been 
observed in breast cancers (KaUionicmi et aL.. 1 994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 13 
(mean 1.02. ± 0.21), and was similar for the 108 primary breast- 
tumor DNA samples. (0.6 to 1.6,. mean r.06 ± 0.25), confirming 

, that alb and app are appropriate reference disomic genes for 
breast-tumor DNA. The low range of the ratios also cdntumed.that 
the nucleotide sequences chosen for the primers .and probes were 

• not polymorphic, as mismatches Of their primers or probes with the- 
: subject's DNA would have resuitediri differential amplification, 

myc, ccndl and trbS2gene dose in normal leukocyte DNA 

To determine' the cut-off point for gene ampIhTcation in breast- 
cancer tissue, . 18 normal leukocyte DNA samples were tested for 
the gene dose (N), .calculated as described in ^Material and 
Methods". The N value of these samples ranged rrora 0,5 to 13 
(mean 0.84 ± 0.22). for mvc, 0:7 .to U6 (mean I.06"± 03) for 
cc«rf/and0.6to L3(mean0.9I ± 6.19>f6rertB2.SinceN values 
> for mya ccndl and erflB2 in normal leukocyte DNA consistently 
fell between 0.5: and 1.6, values of 2 or more were considered to. 
represent gene amplification in tumor DNA : 

. myc, ccndl and erbitt gene dose in breast-tumor DNA ; 

• myc, ccndl and erbB2 gene copy numbers in the 108 primary 
breast tumors are reported in .Table I: Extra copies of ccndl were 

;. more. frequent (23%, 25/108) than extra copies of erbhl (15%, 
_ 167108) and myc (10%, 11/108), and ranged from 2 to 1.8.6 for 
ccndl, 2 to- 15.1 for erbB2, and only 2 to 4.6 for the -wye gene. 
Figure 2 and Table II represent tumors in which the ccndl gene was. 
amplified 16-fold (T145), 6-fold (T133)ahdnor*amphfied (Tn8). 
The 3 genes were never found to.be co-amplified in the same tumor. 
e/#B2 and ccndl were co-amplitled in only 3 cases, myc andccndl 
in 2 cases and myc ander6B2 in I case..This favors the hypothesis 

. that gene amplifications are independent events in. breast cancer, 
taterestingjy, 5 tumors showed a- decrease of at least 50% in the 
erb&l copy number (N < 0.5), suggesting thatthey bore deletions 
of the 17q21' region (the site of erbB2). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

. Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis . - 

■ \ SouthemTblot analysis of myc t ccndl and eroB2 amplifications 

. had previously been done on the same 1 08 primary breast tumors. A 
perfect correlation between the results of real-time- PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N a 5). However, there were cases (1 myc, 6 ccndl and 4 er6B2) 

. in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2.9).. * * : 

• *■ .. . ' • 

DISCUSSION 

The clinical applications of gene amplification assays are 
• currently limited, but would certainly increase if a simple, standard- 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values.' Southern blotting is also time-consuming, uses -radioactive 



TABLE l- DJSTTUBUTION OF AMPLIFICATION LEVELfN) FOR myc. 
catdt AND er&B2 GENES IN 108 HUMAN BREAST TUMORS 
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laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from, minimal 
quantities of tumor DNA. (small early^stage tumors, cytopuncture 
specimens or fcrrnaluvfixed; parafRh-embedded.tissues).- 

In this study, we valfdated a PCR method developed for the 
• quantification of gene over-representation in tumors. The method; 
based on real-time analysis of PCR amplification, has several' 
advantages 6ver other PCR-based quantitative ' assays such as 
competitive quantitative PCR (Celi et ai. 1994). First, me real-time 
PCR method is performed in a closed^tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented, by using the enzyme, uracil* N-glycc*ylase . (TJNG) 
(Longo et aL, 1990). The second advantage is-' the simpticity and 
rapidity of sample analysis, since, no post-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, m triplicate, the nurober of 
copies of a target gerie in more.&an.lOO tumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, . 
meaning that samples do not have to contain equal starting amounts 
of DNA This technique should therefore besuitable for analyzing 
fornialin-fixed, paraffin-embedded tissues. Fourth, and' above all, 
real-time PCR makes UNA quantification much more precise, and . 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product Indeed; 
the ABI Prism 7700 Sequence Detection System enables Q to be 
calculated -when PCR amplification is still in the exponential phase 
and when none pf- the reaction components is rate-limiting. The 
withih-run CV of the C, value for calibrator human DNA (5- 
replicates) was aiways.below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
Ct ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve). are 
similar in our experiments (data not shown). Moreover, unlike 
•competitive quantitative PCR, .real-time PCR does not require an 
. internal control (the design and: storage of internal controls and the 
validation of their amplification efficiency is laborious). 
. _The only potential disavantage ofreaT-time PGR, like all other 
PCR-based methods- and solid-matrix blotting techniques tSouuV 
ern blots and dor blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of 9NA from tumor/ cells contained in 
heterogeneous tissue specimens. Only FISH and immunohtstochen> 
istry can measure alterations on a cell-by-cell basis (Pautetti et ai 
.1996; Slamon et aL, 1989)- However, FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does, not assess gene, expression and therefore * 
cannot detect cases in which the gene product is over-expressed* in 
the absence, of gene' amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immimohistochemistry is 
subject to considerable variations in the hands of different teams, . 
owing, to alterations of. target proteins during the procedure, the 
different primary antibodies and fixation methods used' and the 
criteria used to define positive staining: 

' The results of this study are in agreement with those reported in 
the literature, (i) Chromosome regions 4ql^q 13 and 21q21.2 
(which bear alb and.app.. respectively) showed no genetic alter* 
atibns in the breasfceancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et aL, 1994). (/Q We found that 
amplifications of these 3 oncogenes were independent events, as 
reportedhyomerteams(Berhsera/., 19%Borger.<tt;, 1992):.(/fi) 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and eroB2 amplifica- 
tion, confirming the findings of Borg et aL (1992) and Courjal etaL 
(1997). (rv) The maxima of ccndl and erbB2 over-representation , 
were. 1 8-fold and 1 5-fold, also in keeping with eariienesults (about 
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experiment Triplicate plots were performed for each tumor samplci but the data for only One are shown here, the results are shown in Table!!. 



30-fold maximum).(Berns et aL,. 1992; Borg et aL, 1992; Courja! et 
aL, 1997). (v) The er£B2 copy numbers obtained with real-time 
PGR were in good agreement with data obtained with other 
quantitative PCR-based assays in terms of the frequency and 
degree.of amplification (An et aL, 1 995; Deng et ah t 1996; Valeron 



et aL, 1996). Our results also correlate well with those recently 
published by Gelmini et aL (1997>, who usedthe TaqMan system to 
measure erB&l amplification, in a small series of breast tumors- 
(n = 25), but wiraaniiistrument(LS-$OB luminescence spectrom- 
eter, Peridn-Elmer Applied Biosystems) which only, allows- end- 
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TABLE M - EXAMPLES OF ccndl GENE-DOSAGE RESULTS 
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' 'For each sample, 3 replicate experimeiits were performed and the mean 
and fcs standard deviation (SD) -was determined: The level of ccndl gene 
" amplification (Nccndl/alS)- is determined by drvfding the average ccndl 
. copy nurnber value by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PGR- (yi) We found a high degree of concordance- between 
real-time quantitative PGR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
5-fold). The. slightly 'higher frequency of gene amplification 

• (especially ccndl and er£B2) observed by means of realrtime 

• quantitative PGR as compared with Southern-blot. analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rale out the possibility that some tumors with a few extra 



gene copies pbserved in real-time t>GR had additmnal copies of an 
arm ox a whole chromosome (trisomy, tetrasomy or polysorhy) - 
rather man- true .gene amplification. These 2 types of genetic 
alteration (polysemy and gene 'amplification) could be easily- 
distinguished in the future by using an additional probe located on • 
the same chromosome arm, but some distance from the target gene: ■ 
It is noteworthy that high gene copy numbers have the greatest-: 
prognostic, significance in breast carcinoma (B org ei a/.. 1992: 
Slamon <?r*/.. 1987).' 

Finally, this- technique can be applied to the detection of gene:, 
deletion as weft as gene amplification, indeed, we found a 
decreased copy, number of «roB2 (but not' of the other 2. proto- . 
oncogenes) in several tumors; eroB2 is. located in* a chromospme 
region (llqfcl) reported to contain both deletions and amplifies- •• 
n'ons in breast cancer (Bieche and Lidereau, 1995). 
. In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neoplasia, also for .early diagnosis Of. cancer, 
. staging, prognostication and choic^of treatment Southern blotting 
is not sufficiently sensitive^ and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes bom these limitations, and 
is a sensitive and accurate, method of analyzings large numbers * of 
samples in- a short time. It should find a place in routine clinical 
gene dosage. 
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~10x). Also, some tumor lines have amplified ras 
genes. A 20-fold increase in the level of a nontrans- 
forming Ras protein is sufficient to allow the 
transformation of some cells. The effect has not 
been fully quanlilated, but it suggests the general 
conclusion that oncogenesis depends on over- 
activity of Ras protein, and is caused either by 
increasing the amount of protein or (probably more 
efficiently) by a variety of mutations that increase 
the activity of the protein. 

Transfection by DNA can be used to transform 
only certain cell types. Although transforming 
oncogenes have been isolated from both rodent and 



human cells, most targets for transformation by 
transfection with oncogenes have been rodent 
fibroblasts in culture. (In fact, the difference in the 
source of the oncogene [human] and the recipient 
cell [rodent] is an important factor in allowing 
the donor gene to be distinguished unequivocally 
from recipient DNA.) Limitations of the assay 
explain why relatively few oncogenes have 
been delected by transfection. This system has 
been most effective with ras genes, where there 
is extensive correlation between mutations that 
activate c-ras genes in transfection and the 
occurrence of tumors. 




A variety of genomic changes can activate proto- 
oncogenes, sometimes involving a change in the 
target gene itself, sometimes activating it without 
changing the protein product. In cases of insertion 
and translocation, there is evidence that the 
genomic change is the causative event; in cases of 
amplification there is a correlation with tumori- 
genesis, but no direct proof for a causative role. 

Many tumor cell lines have visible regions of 
chromosomal amplification, as shown by homo- 
geneously staining regions (see Figure 56.27) or 
double minute chromosomes (see Figure 56.28). In 
some cases, the amplified region contains a known 
oncogene or a gene related to one. In other cases, 
where amplification is not visible, the use of 
batteries of probes representing oncogenes shows 
that a particular oncogene is amplified. Examples 
of oncogenes that are amplified in various tumors 
include c-myc, c-abl y c-myb, c-erbB, and c-K-ras. 

Established cell lines are prone to amplify genes 
(it is one of several karyotypic changes to which 
they are susceptible). All the same, the presence of 
known oncogenes in the amplified regions, and the 
consistent amplification of particular oncogenes in 
many independent tumors of the same type, again 



strengthens the correlation between increased 
expression and tumor growth. Of course, it is 
possible that the gene amplification gives an 
advantage to growth of the established tumor; it is 
not necessarily ah event involved in its initiation. 

Some proto-oncogenes are activated by events 
that change their expression, but which leave their 
coding sequence unaltered. The best characterized 
is c-myc, whose expression is elevated by several 
mechanisms. One common mechanism is the 
insertion of a nondefective retrovirus in the vicinity 
of the gene. 

The ability of a retrovirus to transform without 
expressing a v-onc sequence was first noted during 
analysis of the bursal lymphomas caused by the 
transformation of B lymphocytes with avian virus. 
Similar events occur in the induction of T cell 
lymphomas by murine leukemia virus. In each case., 
the transforming potential of the retrovirus seems to 
lie with its LTR rather than with a coding sequence. 

In many independent tumors, the virus has 
integrated into the cellular genome within or close 
to the c-niyc gene. The gene consists of three 
exons; the first represents a long nontranslated 
leader, and the second two code for the c-Myc 
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f protein. Figure 59.8 summarizes the types of in- 

| serlion at this locus. The retrovirus may be inserted 

!;• at a variety oflocations relative to the c-myc gene. • 

The simplest insertions to explain are those that 

F occur within the first intron. The LTR provides a 

| promoter, and transcription reads through the two 

j coding exons. Transcription of c-myc under viral 

} control differs in two ways from its usual expres- 

I sion: the level of expression is increased (because 

j the LTR provides an efficient promoter); and the 

| transcript lacks its usual nonlranslaled leader 

I (which may usually limit expression). 

Activation of c-myc in the other two classes of 

I insertions reflects different mechanisms. The 

I retroviral genome may be inserted within or 

\ upstteam of the first intron, but in reverse orienta- 

\ tion, so that its promoter points in the wrong 

j direction. Probably the LTR provides an enhancer 

1 that acts on an upstream sequence that fortuitously 

\ resembles a promoter. The retroviral genome also 

j may be inserted downstream of the c-myc gene, in 

[ which case ^inscription initiates at the usual c-myc 

j promoter(s),butis increased by the enhancer in the 

I retroviral LTR. 

In all of these cases, th e coding sequen ce oj c-myc 
is unchanged, so oncogenicity is attributed to the loss 
of normal control, and increased expression, of th e 

I gene. 

I ; Other oncogenes that are activated in tumors by 

I I the insertion of a retroviral genome include c-erbB, 

f \ c-myb } c-mos, c-H-ras, and c-raf. Up to 10 other eel- 

| |: hilar genes (not previously identified as oncogenes 

| (r'by their presence in transforming viruses) are 

| I implicated as potential oncogenes by this criterion. 

| |The best characterized among this latter class are 

| \wntl and int2. The wntl gene codes for a protein 

| ^involved in early embryogenesis that is related to 

| f the wingless gene of Drosophila; int2 codes for a 

I f growth factor related to FGF. 

I Translocation to a new chromosomal location is 

| |anotherofthe mechanisms by which oncogenes are 

I lactivaled. Certain chromosomal translocations are 

1 ^consistently associated with activation of onco- 

j penes that lie near the breakpoints. This situation 

\ pvas originally discovered via a connection 

\ petween the loci coding immunoglobulins and the 



y/ Figure 39.8: - 1 
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occurrence of certain tumors. Specific chromo- 
somal translocations are often associated with 
plasmacytomas in the mouse and with Burkitt 
lymphomas in man. These tumors arise from 
undifferentiated B lymphocytes. The common 
feature in both species is that an oncogene on one 
chromosome is brought into the proximity of an Ig 
locus on another chromosome. Similar events 
occur in T lymphocytes to bring oncogenes into the 
proximity of a TcR locus. 

The basic cause of the translocation event is a 
malfunction of the system responsible for recom- 
bining V and C genes or switching between IgH 
C genes. Instead of acting on two sites within the Ig 
or TcR locus, the system recombines the immune 
locus with an unrelated region on a different 
chromosome. This results in a reciprocal 
translocation, which is illustrated in Figure 59.9. 

We do not know the basis for the involvement of 
the nonimmune partner, but in both man and 
mouse it is often the c-myc locus. In man, the 
translocations in B cell tumors usually involve chro- 
mosome 8, which carries c-myc, and chromosome 
14, which carries the IgH locus; *- 10% involve chro- 
mosome 8 and either chromosome 2 (k locus) 
or chromosome 22 (lambda locus). The trans- 
locations in T cell tumors often involve 
chromosome 8, and either chromosome 14 (which 
has the TcRa locus at the other end from the Ig locus) 
or chromosome 7 (which carries TcR /? locus). 
Analogous translocations occur in the mouse. 

Translocations at the IgH locus in B cells fall into 
two classes. One type is similar to those observed 
at other Ig loci and at TcR loci, involving the 
consensus sequences used for V-D-J somatic 
recombination of active Ig genes. In the other type, 
the translocation occurs at a switching site, so these 
cases may be associated with function of the 
system that 'switches expression from one C H gene 
to another. 

When c-myc is translocated to the Ig locus, its 
level of expression is usually increased. The 
increase varies considerably among individual 
tumors, generally being in the range from 2 to 10. 
Why does translocation activate the c-myc gene? 
The translocation event does not involve fixed sites, 




A chromosomal translocation is a reciprocal 
^ieyent. that exchanges parts of two chromosomes, 
cf relocations that activate the human c-myc 

prpto^noogene involve Ig loci in B cells and TcR 

loci in T cells. 
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hut occurs at a variety of locations within a general 
region on each recomhining chromosome. The 
event has two consequences: c-myc is brought into 
a new region, one in which an Ig or TcR gene was 
actively expressed; and the structure of the c-myc 
gene may itself he changed (but usually not involv- 
ing the coding regions). It seems likely thai several 
different mechanisms can activate the c-myc gene 



ONCOGENES: GENE EXPRESSION AND CANCER i 1199 



in its new location (just as retroviral insertions 
activate c-myc in a variety of ways). 

The correlation between the tumorigenic 
phenolype and the activation of c-myc by either 
insertion or translocation suggests that continued 
high expression of the c-Myc protein is oncogenic. 
Expression of c-myc must he switched off to enable 
immature lymphocytes to differentiate into mature 
B and T cells; failure to turn off c-myc maintains the 
cells in the undifferentiated (dividing) slate. The 
oncogenic potential of c-myc has been demon- 
strated directly by the creation of transgenic mice 
carrying a normal c-myc gene linked to an 
enhancer. Transgenic mice carrying a c-myc gene 
linked to a B lymphocyte-specific enhancer (the IgH 
enhancer) develop lymphomas. The tumors repre- 
sent both immature and mature B lymphocytes, 
suggesting that over-expression of c-mycis tumori- 
genic throughout the B cell lineage. Transgenic 
mice carrying a c-myc gene under the control of the 
LTR from a mouse mammary tumor virus, 
however, develop a variety of cancers, including 
mammary carcinomas. This suggests that 
increased or continued expression of c-myc trans- 
forms the type of cell in which it occurs into a 
corresponding tumor. Specificity of the tumor type 
may therefore depend on the mechanism used 
to activate c-myc; it is not an intrinsic property 
of the gene. 

c-myc exhibits three means of oncogene 
activation: retroviral insertion, chromosomal 
translocation, and gene amplification. The com- 
mon thread among them is increased expression of 
the oncogene rather than a qualitative change in its 
coding function, although in at least some cases 
the transcript has lost the usual (and possibly 
regulatory) nontranslaled leader, c-myc provides 
the paradigm for oncogenes that may be effectively 
activated by increased (or possibly altered) 
expression. 

Every translocation generates reciprocal prod- 
ucts; sometimes a known oncogene is activated in 
one of the products, but in other cases it is not 
evident which of the reciprocal products has 
responsibility for oncogenicity. Also, it is not 
axiomatic that the gene(s) at the breakpoint have 



responsibility; for example, the translocation could 
provide an enhancer that activates another gene 
nearby. 

A variety of translocations found in B and T cells 
have identified new oncogenes. In some cases, the 
translocation generates a hybrid gene, in which an 
active transcription unit is broken by the transloca- 
tion. This has the result that the exons of one gene 
may be connected to another. In such cases, there 
are two potential causes of oncogenicity: the proto- 
oncogene part of the protein may he activated in 
some way that is independent of the other part, for 
example, because it is over-expressed under its 
new management (a situation directly comparable 
with the example of c-myc); or the oilier partner in 
the hybrid gene may have some positive effect that 
generates a gain-of-function in the part of the 
protein coded by the proto-oncogene. 

One of the best characterized cases in which a 
translocation creates a hybrid oncogene is provided 
by the Philadelphia (PH l ) chromosome present 
in patients with chronic myelogenous leukemia 
(CML). This reciprocal translocation is loo small to 
be visible in the karyotype, hut links a 5,000 kh 
region from the end of chromosome 9 carrying c-abl 
to the bcr region of chromosome 22. The bcr (break- 
point cluster region) was originally named to 
describe a region of -5.8 kb within which break- 
points occur on chromosome 22. Different cases of 
CML have breakpoints at different locations within 
this region. 

The consequences of this translocation are sum- 
marized in Figure 59.10. The bcr region lies within 
a large (>90 kb) gene, which is now known as the 
bcr gene. The breakpoints in CML usually occur 
within one of two introns in the middle of the gene. 
The same gene is also involved in translocations 
that generate another disease, ALL (acute lym- 
phohlasticleukemia);in this case, the breakpoint in 
the bcr gene occurs in the first intron. 

The c-abl gene is expressed by alternative 
splicing that uses either of the first two exons. The 
breakpoints in both CML and ALL occur in the 
intron that precedes the first common exon. 
Although the exact breakpoints on both chro- 
mosomes 9 and 22 vary in individual cases, the 
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two types of leukemia. 
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daltons of the usual ~ 145,000 c-Abl protein, that is, 
it has lost just a few N-terminal amino acids of the 
c-Abl sequence. 

Why is the fusion protein oncogenic? It relies 
on an interaction between the N-terminal region 
provided by bcr with the c-Abl protein. The far gene 
has a variety of sequence motifs related to proteins 
involved in signalling pathways, but the pertinent 
one is a serine/threonine kinase activity that is 
coded within the first exon. This aulophosphory- 
lates residues in this part of the protein, and the 
phosphorylation confers the ability to interact with 
a region of the c-Abl protein called the SH2 domain 
(we discuss the nature of SH2 domains later). This 



raises the possibility that the Bcr part of the pro- 
tein interacts with the c-Abl sequences, perhaps 
changing their conformation and activating a 
latent oncogenic potential. 

Changes at the N-terminus are involved in the 
activation of oncogenic activity of v~abl y a trans- 
forming version of the gene carried in a retrovirus. 
The c-abl gene codes for a tyrosine kinase activity; 
this activity is essential for transforming potential 
in oncogenic variants. Deletion (or replacement) of 
the N-terminal region activates the kinase activity 
and transforming capacity. So the N-terminus 
provides a domain that usually regulates kinase 
activity; its loss may cause inappropriate activation. 



Loss of tumor suppressors causes tumor formation . 
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The common theme in the role of oncogenes in 
tumorigenesis is that increased or altered activity of 
the gene product is oncogenic. Whether the onco- 
gene is introduced by a virus or results from a 
mutation in the genome, it is dominant over its 
allelic proto-oncogene(s). A mutation that activates 
a single allele is tumorigenic. Tumorigenesis then 
results from gain of a function. 

Certain tumors are caused by a different mech- 
anism: loss of both alleles at a locus is tumorigenic. 
Propensity to form such tumors may be inherited 
through the germline; it also occurs as the result of 
somatic change in the individual. Tumorigenesis 
then results from loss of function. Such cases iden- 
tify tumor suppressors: genes whose products are 
needed for normal cell function, and whose loss of 
function causes tumors. The two best characterized 
genes of this class code for the proteins KB and p55. 

Retinoblastoma is a human childhood disease, 
involving a tumor on the retina. It occurs both as a 
heritable trait and sporadically (by somatic muta- 
tion). It is often associated with deletions of band 
q!4 of human chromosome 13. The RB gene has 
been localized to this region by molecular cloning. 
Figure 59.11 illustrates the situation. 



Retinoblastoma arises when both copies of the RB 
gene are inactivated. In the inherited form of the 
disease, one parental chromosome carries an alter- 
ation in this region, usually a deletion. A somatic 
event in retinal cells that causes loss of the other 
copy of the RB gene causes a tumor. In the sporadic 
form of the disease, the parental chromosomes are 
normal, and both RB alleles are lost by (individual) 
somatic events. 

Almost half the cases of retinoblastoma show 
deletions at the RB locus. In other cases, transcripts 
of the locus are either absent or altered in length. 
The protein product is absent from retinoblastoma 
cells. The cause of the tumor is therefore loss of 
protein function, usually resulting from mutations 
that prevent gene expression (as opposed to point 
mutations that affect function of the protein prod- 
uct). Loss of RB is involved also in other forms of 
cancer, including osteosarcomas and small cell 
lung cancers. 

What is the molecular function of RB? It interacts 
with a variety of other proteins, including several 
tumor antigens: SV40 T antigen, adenovirus El A, 
human papilloma virus E7. One possibility is that 
part of the oncogenicity of these proteins is due to 



REVIEW ARTICLE Medical Biology 62: 304-317, 1984 
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ABSTRACT 

Regulatory or structural alterations of cellular oncogenes have been implicated in the causation of various 
cancers. Oncogene alteration by point mutations can result in a protein product with strongly enhanced on* 
cogenic potential. Aberrant expression of cellular oncogenes may be due to tumour-specific chromosomal 
translocations that dysregulate the normal functions of a proto-oncogene. Amplification of cellular unco* 
genes can also augment their expression by increasing the amount of DNA template available for I he pro- 
duction of mRNA. It appears that amplification of certain oncogenes is a common correlate of llu- progrcs- 
sion of some tumours and also occurs as a rare sporadic event affecting various oncogenes in different types 
of cancer. Amplified copies of oncogenes may or may not be associated with chromosomal abnormalities 
signifying DNA amplification: double minute chromosomes and homogeneously staining chromosomul 
regions. Amplified oncogenes, whether sporadic or tumour type-specific, are expressed at elevated level*, 
in some cases in cells where their diploid forms are normally silent. Increased dosage of an amplified 
oncogene may contribute to the multistep progression of at least some cancers. 

KEY WORDS: CELLULAR ONCOGENES, GENE AMPLIFICATION, MULTISTEP CARCINOGENESIS, CLONAL SFXECTION 
KARYOTYPIC ABNORMALITIES, DOUBLE MINUTE CHROMOSOMES, HOMOGENEOUSLY STAINING CHROMOSOMAL 
REGIONS 



DNA SEQUENCE AMPLIFICATION AND 
. CYTOGENETIC ABNORMALITIES 
IN TUMOURS 

Since its discovery in drug-resistant eukaryotic 
cells, somatic amplification of specific genes 
has been implicated in an increasing variety 
of adaptive responses of cells to environmental 
stresses {70, 79). Cytogenetic abnormalities, 
double minute chromosomes [dminrs] asso- 
ciated with DNA amplification had already 
been discovered in tumour cells before the 
discovery of dmin:s and homogeneously stain- 
ing chromosomal regions (HSR:s) in cells se- 
lected for drug-resistance (12, 24, 49, 50, 56). 
In metaphase spreads, dmin:s appear as small, 
spherical, usually paired chromosome — like 
structures that lack a centromere (Fig. 2). HSR:s 
stain with intermediate intensity throughout 
their length rather than with the normal pat- 
tern of alternating dark and light bands in 
both trypsin-Giemsa (Fig, 3A) and quinacrine 
dihydrochloride-stained preparations. Both 
kinds of abnormalities are occasionally found 
in metaphases of freshly isolated cancer cells 
but not of normal cells (8). 

Dmin:s and HSR:s are apparently rare in 
tumour cells in vivo, although exact data are 



difficult to obtain since the abnormalities arc 
easily missed in routine cytogenetic analysis 
(8, 42). Dmin:s and HSR:s have been de- 
scribed in most types of in vitro-culturcd malig- 
nant tumour cells, with a notable frequency 
in neuroblastoma cell lines (11). Initial growth 
in cell culture apparently selects for tumour 
cells that contain either dminrs or HSR:s. 
Moreover, in culture dmin:s are frequently 
lost, concomitant with the appearance of 
clonal populations of cells that have devel- 
oped an HSR, suggesting that the two cyto- 
genetic abnormaltities are alternative forms of 
gene amplification and that HSR:s may confer 
a selective advantage on cells over dmin:s (1 1, 
70). It has been assumed that HSR:s can break 
down to firm dmin:s and that dmin:s can in- 
tegrate into chromosomes to generate HSR:s 
[11, 23). Amplified genes may also occupy ab- 
normally banding regions, ABR:s (51, 59). Ex- 
perimental work on drug-resistant cells has 
shown that in the absence of a selection pres- 
sure (drug), dminrs and the amplified genes 
that they contain are lost, whereas amplify 1 
DNA in the form of HSR:s is retained in iw 
cells [71). This is explained by the fact that 
dminrs are segregated unevenly in mitosis an 
frequently get lost from the nucleus due to 
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TABLE 1 
Currently known oncogenes. 
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ONCOGENES FOUND IN RETROVIRUSES 
Retrovirus Oncogene 
{example} 



Gene product 



Cellular location 



Function of protein 



Class 



RSV 

Y73V 

GR-Fe$V 

Ab-MuLV 

FuSV 

ST-and 

GA-FeSV 

UR2V 



sre Plasma membrane 

yes Plasma membrane 

fgr Plasma membrane 

abl Plasma membrane 

fps/fes Cytoplasm 

(plasma membrane?} 
fes/fps Cytoplasm 

jcytoskeleton?) 



Tyrosine-specific 
protein kinases 
[fgr contains sequences 
homologous to actin) 



Class la {Cytoplasmic 
tyrosine protein kinases) 



AEV 



SM-FcSV 



| MH-2V 
! 39H-MSV 
; Mo-MSV 



crb-B 



fms 



mi)/raf 
raf/mi! 
mos 



Plasma membrane 
and cytoplasmic 
membranes 
Plasma membrane 
and cytoplasmic 
membranes 
Cytoplasm 
Cytoplasm 
Cytoplasm 



EGF receptor's cyto* 
plasmic domain 

Cytoplasmic domain of 
a growth factor receptor? 

? 

? 
? 



Class lb (Class la-related 
proteins) 





sis 


Secreted 


PDGF-like growth factor 


Class 2 (Growth factors) 


■ Ha-MSV 
j Ki-MSV 


Ha-ras 
Ki-ras 


Plasma membrane 
Plasma membrane 


GTP -binding proteins 


Class 3 {Cytoplasmic 
GTP:ases) 


'J FBJ-MuSV 
1 OK-10V 
i AMV 


fos 

myc 

myb 


Nucleus 
Nucleus 
Nucleus 


? 

Nuclear matrix protein 

? 


Class 4 (Nuclear phospho- 
proteins) 


i SKV 770 
1 REV 
j AEV 
E26V 


ski 
rel 

erb-A 
et$ 


Nucleus? 

? 

7 

? 


? 
? 
? 
? 


Unclassified 


J ONCOGENES FOUND IN TUMOUR CELLS BUT NOT IN RETROVIRUSES 
1 Tumour cell 


i Neuroblastoma 
) Neuroblastoma 
( Small-cell lung cancer 
] Meuro/GHoblastomas 


N-ras 
N-myc 
h-myc 
neu 


Plasma membrane 

? 

? 

Plasma membrane 


GTP-binding 

? 
? 

Growth factor receptor 


Class 3 
Class 4 
Class 4 
Class lb 



their lack of centromeres, (49). HSR chromo- 
somes carry centromeres and are therefore 
divided equally at mitosis. If dminrs and 
HSR:s contain amplified genes that encode 
growth-stimulating protein products, it would 
follow that the more stable chromosomal 
form, the HSR, confers a greater selective 
growth advantage for cells. Although dmin:s 
and HSR:s have been described predominant- 
ly in tumour cells selected for resistance to cy- 
totoxic drugs, it is also clear that dmin:s and 
HSRrs may be present in cancer cells before 
*ny form of therapy (8). It was in this setting 
that we and others first chose to explore the 
Possible amplification of cellular oncogenes. 
'By definition, cellular oncogenes are normally 



innocent genetic loci which can be activated 
to transforming genes in various ways). 1 



DMIN:S AND HSR:S CONTAIN AMPLIFIED 
ONCOGENES 

Table 2 summarizes the somatic amplifica- 
tions of cellular oncogenes so far reported in 

1 It is not the purpose of this review to deal with all forms 
of DNA damage that have been found to activate cel- 
lular oncogenes. For the purpose of integrating the re- 
view into a coherent picture, however, the reader is 
given a list of known cellular oncogenes in Table 1 and 
the schematic Figure 1 illustrating the various ways in 
which the oncogenic potential of different proto-onco- 
genes can be activated. 
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Fut 1 Activation of cellular oncogenes. The haploid complement of a proto-oncogene is schematically depicted in A 
composed of three exons (black boxes) in a segment of DNA. The different activated forms are schematically outlined 
tn7-G. The abbreviation c-okc stands for cellular oncogene, v-onc viral oncogene. DN A se quences wUh as *oc.ated 
strong promoter/enhancer functions are striated, and an actively transcribed gene is marked with radiations. B. Acute 
? ansformm? retroviruses have the capacity to transduce cellular oncogenes (c-okc) into the.r genome, mod. y them 
XiS their activated oncogenes (voLl into the genome of host animal cells as a part of the.r provira forms. 
The ac«vUy o?the v o«c gene is greatly enhanced due to the associated promoter of the provual long termmal repea . 
BothTncrea y S ed dosage of the oncogene and structural mutations within its sequence may contntmte to t™»ngn«* 
d Slow transforming^ retrovirusestithout oncogenes replicate and reinsert the.r proviral cop.es into the host cell DNA 
during latency period from infection to tumorigenesis. Tumor initiation through n yP er P las ^^^ 
when the provi™ integrates sufficiently close to a proto-oncogene to activate it through P romol ( ! r ^nchancer func- 
tons It should be noted, however, that mutations have also been found in the oncogenes thus activated and that muta- 
tional damag Tto other oncogenes has been described in the resulting tumors. D. n some ^P^^been 
r°trovinis-like DNA elemenf (directing .he synthesis of the so-called intracistemal - h . a ^ of 
found in association with a transcriptionally activated oncogene c-mos. The IAP insertion also disrupts the 5 i part of 
c mos (64) E. In humans, as well as in animals, chromosome translocations may place proto-onoogenes nto tnmenp- 
Uo^lly active regions of chromatin, where they may be activated. The details of this mechan.sm have not been wo ked 
out but It is believed to occur for c-myc and cibt genes in Burkitt lymphomas and Philadelphia-chromosome pos we 
teukemias reSMCtive V 135 40i: F. Increased amounts of oncogene-specific RNA and protein can also result from an 
exc£s Tofb^temSe for transcription acquired through oncogene amplification. The V^^™£"»*- 
Primarily on this mechanism. G. Mutationally activated oncogenes have been found m nearly one fifth ,of h«^n malg 
nant tumours. Oncogene loci activated by somatic structural mutations are reveled by transfecUon ^P"'™"' 5 ,; 
where thev are introduced into genetic background of nontumourigenic cultured immortalized cells Several such 
taansfo S I tec have been cloned and man/of them belong to the c-ros oncogene family. It : should be , pointed I out 
ha" b^h structural mutations and either increased expression or activation of a complementing oncogene may be 
required to achieve a fully tumorigenic phenotype (44). 



tumour cells. Although the sampling of tu- 
mours is at present small, the finding of 
known cellular oncogenes among amplified 
DNA represented by dmin:s and HSR.s of 
cancer cells is provocative. Amplification has 
been found to affect at least five out of twenty 
known cellular oncogenes and the degree of 
gene amplification varies from five to many 
hundred-fold over the single haploid copies 
found in normal cells (see also ref. 18). The 
first amplification reported involved the c-myc 



oncogene (see Table 1) in a promyelocyte leu- 
kaemia cell line HL-60 (20. 25). The degree of 
c-myc amplification is between 8—32 fold 
both in the HL-60 cell line and in primary leu- 
kaemic cells from the patient (20. 25). Origi- 
nal clonal lines of HL-60 were later found to 
contain some dmhv.s in culture but their num- 
ber was insufficient to establish any clear cor- 
relation with amplified c-myc. Such a correla- 
tion, however, was discovered for c-myc am- 
plification in a neuroendocrine cell line from 
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GIEMSA 
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CHROMO- 
SOMES 



i'c 



DMIN.'S 



*l COLO ™m! T™* ? hromt *°mes (arrowheads) in 
' resold 20DMco ' on ^rcmoma cells. A. The drain* are 
> ffi S d< ? ,s amon 8 normal chromosomes in 

I Purin r"^ ^""^ole-slained preparation B-D. 



Purir; .• , mQazole slainea Preparation B-D. 

, ^nr Ica , 10n of dmra:s by differential centrifugations. B. 

1 f M w g ^ aterla1 ' C Chr °™some fraction. D. Puri- 
j! "«l d m ,n:s (Donna George and the author, unpublished 



data and ref. 52J. 



ceiu carcinoma . COLO 320 (5). In these 
«»s, the approximately 30-fold amplified 
Wye copes were mapped either to HSR:s of 
' marker chromosome (5, Fig 3BJ or to dmin:s 
^(. depending on the particular subline stud- 
Dri m = nC f mm ' S T le alread y P r «ent in the 
TT, C , e ' ,S fr0m this colon c «cino- 

•ftD&J W 617 th f °/^ C had a,so been 
JP hfted during growth of the tumour in vivo. 

iCem, ?* am P llfied copies of the c-Ki-ras on- 
»mo ue T mappe . d t0 dmin:s and HSR:s of 
Cve c e ^^° rt ' cal turnor « (74). An ex- 
'«d h,n^ »° r L cha ^ ges m other oncogenes 
C K l Ce "f h u S SinCe revealed amplifica- 
fl s that do not show up as dmin:s or HSR:s. 




B 



thl c h^;Sf use ° gen , e0US l. y stainin 8 re 8 ions l HSR ) >" 
me G-banded HSR-marker chromosome comprise a ma- 

ZXTT ° f botb '? lon 8 and sIwrt a™™ The HSR- 
= hr0I ? 0 . s ? m J e 5 as ev <>lved from an X-chromosome 

m COLO \? n ampl, r Cd ? pi t S ? the onc °8 e ne 
HSR. t»? ? llswere found 10 °e located to drain* and 
niK.s. The latter is shown here by in silu-hybridization 
(S, 52). 



Thus for example, the c-myb oncogene is 
amplified m a characteristic marker chromo- 
S0 ^l™ a colon car cinoma without evidence 
of HSR:s (ref 6, Fig. 4] and in other tumours, 
the amplified c-abl and c-wiyc oncogene loci 
map to abnormally banding regions (ABRrs) in 
translocated or resident chromosomal seg- 
ments, respectively (59, 76). 

TRANSLOCATIONS AND 
REARRANGEMENTS MAY ACCOMPANY 
ONCOGENE AMPLIFICATION 

The evolution and progression of the karyo- 
!£?V tumour ce "s is complicated (see ref. 
08). Concomitant with amplification, DNA se- 
quences acquire an increased mobility in the 
genome with extrachromosomal intermediates 
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TABLE 2 

Sporadic and tumour-specific amplification of cellular oncogenes. 



Tumour cells 



Onco- 
gene 



Fold 

amplified 



Chromosomal 
location of 
amplified gene 



Expression 
elevated 



Sporadic: 

HL60 {acute promyelo- c-myc 20 x 
cytic leukaemia, M3} 

COLO320 (colon c-myc 30 x 
carcinoma) 

Yl (adrenocortical c-Ki-ras 50 x 
tumour) 

COLO201/205 (colon c-myb 10x 
carcinoma) 

K562 (chronic myelo- c-abl lOx 
genous leukaemia, CML) 



8q(ABR) 
dmin, HSR 
dmin, HSR 
marl 

mar(ABR) 



A431 (epidermoid 
carcinoma) 



cerbh 15—20 x n.d. 



ML1— 3 (acute myeloid c-myb 5— 10 x 
leukaemia, M2) 



SK BR-3 (breast 
carcinoma) 

SEWA (polyoma virus- 
induced mouse tumour) 



c-myc 10 x 
c-myc 30 x 



Lu-65 (lung giant cell 
carcinoma) 

Primary leukemic cells 
from an acute myeloid 
leukemia (M2) patient 



c-myc 8 x 
c-Ki-ras 10 x 
c-myc 33 x 



n.d. 

n.d. 
n.d. 



n.d. 
n.d. 
n.d. 



Tumour-specific: 

small-cell lung cancer c-myc up to 80x n.d. 



Neuroblastomas 
Glioblastomas 



L-myc, 
N-myc 

N-myc up to 250 x dmin, HSR 
c-er&B — — 



Yes 
Yes 
Yes 
Yes 

Yes 
Yes 

Yes 

Yes 
Yes 



n.d. 
n.d. 
n.d. 



Yes 



Yes 



Remarks 



References 



Amplification present in 20, 25. 59 

primary leukaemic cells 

Part of the amplified 4, 5, 52 

c-myc sequences rearranged 

Levels of p 21 eKi '<u 74 

protein elevated 

Patient treated with 4, 6,88 

5-fluorouracH prior to 
culturing of the tumour cells 
C x coamplified in the 21, 22, 41, 

marker that may be derived 54, 76 
from chromosome 22, c-abl 
protein-associated tyrosine 
kinase activated 

Amplification linked to 82 
chromosome 7 translocation 
and sequence rearrangements 
Amount of protein product, 
the EGF receptor, elevated (see 36) 
Abnormalities of chromo- 34. 61, 91 
some 6q22— 24, where 
c-myb is normally located 

43 

Cells have dmin:s depending Manfred 
on culture conditions; Schwab, 
c-myc amplification personal 
correlates with growth commu- 
as a tumour nicalion 
At least some copies of 80 
c-Ki-ras mutated ' 

Unpublished | 

data of the . 

author and J 

A. de la • 

Chapel le I 

Most amplifications in the 53, 69 I 
variant phenotype of SCLC 

N-myc also amplified in 14, 48, 72, 
primary tumours of 73„ 75 

advanced grade 

Rearrangements also found Josef 
personal communication Schlessin- 
ger, per- 
sonal com- 1 
municalion 



n.d. = not determined, mar - marker chromosome, M2, M3 refer to the French-American-Briltish classification of 
acute myeloid leukemias. 

* At least one case of oncogene amplification in normal germ-line cells has been found (18). 



visualized as dmin:s, transpositions and trans- 
locations to other chromosomal segments, etc. 
(see 70 for references). There may not be pre- 
ferred chromosomal sites for the apparent 
reintegration of dmin:s as HSR:s (75|. In at 



least one case, however, an oncogene may 
have been caught amplifying in situ in its resi-. 
dent chromosomal site (59). The finding of 
moderately amplified oncogenes also in chro- 
mosomal sites lacking HSR:s suggests thai 
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marl u_ cr 

kbp up 




A B 



Fig. 4. Localization of amplified c-myb in COLO 201/205 
cells by in silu hybridization. Shown is a characteristic, 
large marker chromosome (marl) with G-banding (A) 
and associated c-myb autoradiographic grains [BJ. Note 
the absence of HSR:s. Marl has probably evolved from 
chromosome number 6, the resident site of the c-myb on- 
cogene In normal cells (34, 88, 91). (Robert Winqvist and 
the author, unpublished data}. 



(onco)gene amplification may be more com- 
mon than the structural alterations shown by 
chromosome banding and microscopy (6, 88). 

In at least three cases reported amplification 
has been accompanied by a DNA rearrange- 
ment of the oncogene (5, 20, 82). In the colon 
carcinoma COLO 320 both damaged and nor- 
mal versions of the c-myc gene are amplified 
(5). Although individual cell clones have not 
yet been examined, our unpublished experi- 
ments suggest that the same dmin-containing 
cells harbor and express both normal and re- 
arranged forms of c-myc. The normal version 
?fthe amplified gene, however, predominates 
to COLO 320 cells containing HSR:s; the rear- 
ranged version is present only in what ap- 
pears to be a single copy (Fig. 5). In the 
cnronic myeloid leukaemia (erythroleukae- 
m »a) cell line K562 an amplified DNA segment 
consists of portions of both the c-abl onco- 
pne and the immunoglobulin locus (76). 
,n both cases abnormal transcripts are prod- 
ded from the rearranged amplified oncogenes 
£ l g. 6 and ref. 22). In K562 cells, the abnor- 
mal c-abl oncogene product has also been acti- 
ated iis a tyrosine protein kinase (41). It is not 
Known, however, whether structural alter- 
*!° ns of the genes preceded amplification or 

aether they were acquired during the process 
' gene amplification. It seems likely that a 
, nr omosomal translocation of c-abl to the Q 
Ksro P receded DNA amplification in the 
oo2 cells, since all amplified copies were al- 

cem ea r ranged 121,1 with the chan 8 e 

reminis- 

"tot the Philadelphia translocation (t(9, 22)) 
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Fig. 5. Amplification and rearrangement of c-myc in 
COLO 320 cells. 10 fig of cellular DNA was digested with 
Sst I, electrophoresed, blotted and probed with a v-myc 
Pst I fragment (ref. 2, left panel). Fragments of 2.7 kbp 
and 1.4 kbp are seen in both normal and amplified c-myc 
DNA. The 3.3 kbp fragment is derived from a DNA seg- 
ment of unknown origin translocated to the 5' region of 
c-myc with a concomitant deletion of its first exon (unpub- 
lished data of Manfred Schwab and the author). HSF, 
human skin fibroblasts; DM, COLO 320 DM cells; HSR, 
COLO 320 HSR cells. Different amounts of DNA from 
COLO 320 DM cells as indicated were mixed with calf 
thymus DNA to give 24 jig of total DNA, cleaved with 
Sst I. electrophoresed, blotted and probed with a frag- 
ment of 3 'human c-myc sequences. The intensities of the 
2.7 kbp c-myc fragment in different samples were com- 
pared to assess its copy number, estimated to be about 
30 j5). 



found in most chronic myeloid leukaemia tu- 
mours (35, 66—68). Although they have not 
been sequenced, other reported cases of am- 
plified oncogenes are apparently normal on 
basis of mapping with restriction endonu- 
cleases (see Table 2). Therefore we cannot at 
present view mutation as a necessary com- 
panion of oncogene amplification. 



THE MECHANISMS OF GENE 
AMPLIFICATION 

The mechanisms of gene amplification and 
the structure of the amplified DNA have been 
worked out mainly in experimental settings 
involving selection for drug-resistance in cell 
culture (70). Although the mechanisms are 
still incompletely known and may vary in dif- 
ferent cases, some general features have 
emerged. 

A spontaneous degree of illegitimate DNA 
replication seems to exist in normal cells so 
that various segments of DNA are replicated 
more than once during a single cell cycle (37). 
In unselective conditions this DNA is probab- 
ly lost e.g., through formation of micronucleae 
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Fig. 6. Comparison of the electrophoretic mobilities of 
c-myc mRNArs from COLO 320 DM and HSR cells. The 
size of the normal c-myc mRNA is 2.3 kb. The rearranged 
c-myc locus in DM cells (see Fig. 5) seems to be pre- 
dominantly expressed giving rise to a shortened RNA. 



because the newly synthesised extra copies of 
DNA are not covalently linked to chromo- 
somal DNA of mitotic cells (65, 71). If, how- 
ever, there is a selection pressure to retain an 
increased gene dosage, progressive multipli- 
cation of gene copy number results. The in- 
cidence of cells bearing amplified genes under 
conditions of cytotoxic selection can vary by 
two orders of magnitude and is greatly in- 
creased by the presence of mitogenic sub- 
stances (hormones or tumour promoters) dur- 
ing selection (10, 84, 85) or certain carcino- 
genic or cytotoxic agents before selection (15, 
55, 79, 80, 81, 85). An interesting hypothesis 
suggested by Varshavsky (84, 85) supposes that 
the origins of DNA replication "fire" (initiate 
replication) illegitimately several times during 
a single cell cycle and that this kind of "repli- 
con misfiring" may be increased by substances 
such as tumour promoters and mitogenic hor- 
mones (10, 84, 85). Mariani and Schimke (55) 
point out that most of the cytotoxic agents that 
increase the incidence of gene amplification 
are inhibitors of DNA synthesis. Aberrant 
replication is known to take place after tran- 
sient inhibition of DNA synthesis and this re- 
sponse can lead to gene amplification (46, 47, 
55, 90). Mitogenic hormones probably increase 
disproportionate DNA replication, but they 



also enhance the colony forming efficiency of 
drug-resistant ceils in selective conditions (10). 

According to the studies of Axel and his col- 
laborators (65), the multiple cycles of un* 
scheduled DNA replication at a single locus 
during a single cell cycle result in a structure 
schematically outlined in Fig. IF. The hydro- 
gen-bonded amplified copies of DNA depicted 
in Fig. IF must resolve into a tandem linear 
array before the next mitosis. This may well 
occur by homologous recombination between 
any one of several repeated sequences within 
the amplified domain (45, 65(. Part of the re- 
combinations would lead to extrachromosomal 
circles possessing an origin for replication (16, 
62); these could be the precursors of dmin:s. 
The unequal recombinations mean that the 
resolved linear structure consists of tandemly 
repeated but heterogeneous units. According 
to Axel's model a gradient of amplification is 
formed so that centrally located sequences are 
amplified more than sequences distal to the 
origin of replication (65). This also has, in fact, 
been found to explain the large, complex DNA 
domain amplified in neuroblastoma cells in 
vivo (38, see also below). 

The chromosomal site of integration of 
transfected genes significantly affects the fre- 
quency and cytogenetic result of their experi- 
mentally induced amplification (83). The 
amplification frequency in some transfor- 
mants has been found to be 100-fold that of 
the others (83). This suggests that there also 
are preferred chromosomal positions for am- 
plification of host cellular genes and that chro- 
mosomal rearrangements may facilitate gene 
amplification by positioning chromosomal se- 
quences in a favorable array. In respect of the 
structural properties of the sequences involved 
in gene amplification, recombinatorially active 
regions have been implicated in experimental 
cases, DNA rearrangements involving restric- 
tion fragment length polymorphisms and 
variation in gene copy number have been de- 
tected in the human genome between clusters 
of short repetitive interspersed DNA sequences 
called Alu family DNA-sequences (17). Such 
inter-Alu sequences have also been detected 
in an extrachromosomal DNA form, including 
covalently closed circles (17, 78). The copy 
number of inter-Alu sequences apparently va- 
ries in an age- and tissue-specific manner (17, 
78), but any comprehensive analysis of the 
phenomenon in human tumours is not yet 
available. It is also not yet clear whether these 
kinds of repetitive sequences are involved in 
generating amplified oncogene sequences in 
dmin:s or HSR:s in tumours. 
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Fig 8 A. Indirect immunofluorescence for p21^« in Yl 
DM cells. Similar fluorescence of the plasma membrane 
was obtained for the Y l HSR cells. Inset (b| s hows xontro 
staining with normal rat serum and mset (c) JtmiuogJ at 
normal rat kidney cells with the monoclonal antibody 
against p21. 



NRS anti-p21 



Fig. 7. Elevated levels of the pZl«*« protein in Yl cells 
(741 The Yl DM and HSR cells which harbor a 50-fold 
amplified c-KWas oncogene (74) and control cells were 
labeled with ['^methionine and the -j£l*"T prolem 
was immunoprecipitated, as detat ed Z 4 !-^^^™ 
rat serum (NRS) or rat monoclonal anti-p21 serum. The 
proteins were electrophoresed in a 15 % polyacrylamide 
gel in the presence of SDS. In addition to a major p21 

. band, a labeled band at about 16 kd was present in the 
^immunoprecipitates. The amount of radioactivity in p21 

T was about 50 fold that in normal rat kidney eel s. The 
t Kristen sarcoma virus-transformed rat kidney cells [ob- 

I tamed from the American Tissue Culture Collection) also 
•yielded unexpectedly low amounts of the v-Ki-ras 

[t protein. 



CARCINOGEN-INDUCED GENE 
'AMPLIFICATION AND CLONAL 
SELECTION OF CANCER CELLS 

Although cell sorting experiments have 
..shown a basal spontaneous rate of gene ampli- 
fication in eukaryotic cells (37), this can be 
6 ^creased severalfold by metabolic inhibitors 
* cytotoxic agents (15, 37, 70, 81, 85). In 
. toany respects the latter response is reminis- 
cent of the so-called SOS-response elicited in 
Weriae by noxious stimuli (see 28). In a te- 
kologkal context, the rapid induction of gene 
Amplification that apparently occurs frequent- 
ly through extrachromosomal intermediates 
m ay provide cells with genetic material for 
^bsequent selective pressures operating in 
Armful conditions (60}. In cancer cells, the 
Mechanism may enhance the emergence of 



clonal populations of cells with increasingly 
malignant properties (58, 60). Such genetic 
instability of cancer cells is clearly enhanced, 
leading to the rapid evolution of increasingly 
malignant tumour cell populations (19, 58). A 
serious question of practical importance is 
whether drug resistance in treated patients 
also selects cells that have an enhanced ability 
to amplify (onco)genes important for growth 
and progression of the tumour (84, 85). It is 
also possible that some of the carcinogenic in- 
sults caused by mutagens are only expressed 
as a result of subsequent amplification events 
induced by tumour promoters (84, 85) or facil- 
itated by hormones in, say replicating epithelial 
cells (10). The persistence of dmin:s in some 
tumours suggests that there is a selection 
pressure for their retention (8, 9, 11, 23). 
Amplified DNA in dmin:s must contain an 
origin for DNA replication (62) and must be 
selected for in daughter cell populations, 
where it is unevenly segregated (71). In the 
absence of such a selection pressure dmm:s 
are lost (71). In at least one study the length of 
an HSR has been found to increase during a 
selection of malignant cells for enhanced 
tumourigenicity (30). 

The amplified c-erbB gene in A431 cells codes 
for epidermal growth factor receptor (27). The 
abundant amounts of receptor protein on 
A431 cell surface may, however, provide the 
cells with an abnormal growth response (31). 
A naive supposition is that the amplified se- 
quences in dmims and possibly in HSR:s of 
tumours contain growth-promoting genes (see 
36 for references). This seems to fit well with 
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Fig. 9. Construction of a v-myc expression vector. A synthetic linker {ATG AGAATTCATCATJ containing a translational 
initiation codon was inserted downstream from the Up promoter in the pSRC ex 16 Rl expression vector described 
previously (see ref. 3). Approximately one "half of the v-src sequences coding for the aminotermmal portion of pp60 fc " ,w 
protein were then deleted and the remaining portion ligated in translational codon frame with the synthetic ATG. A 
Hinc II fragment of v-myc from plasmid clone MC 38 (nucleotides 320—685 in the v-myc sequence in ref. 2) was ligated 
downstream form remaining v-src sequences in continuity with its reading frame. The resulting product contained 3 
amino acids from the synthetic linker, 252 amino acids encoded by the 756 base pair fragment from Sma I to Pvu H 
restriction sites in v-src DNA. 122 amino acids from the v-myc and 6 amino acids (corresponding to nucleotides 

2968-2085) from the pBR322 vector (3). 
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| r ecent findings on amplified oncogenes, though 
j i n many cases the search for an amplified on- 
cogene is still continuing. Even positive find- 
j i n gs do not mandate a role for amplified eel- 
Mular oncogenes, however, because the do- 
main of amplified DNA is inevitably much 
larger than a single genetic locus [e.g. 38). 



ENHANCED EXPRESSION OF AMPLIFIED 
ONCOGENES 

In all cases where they have been studied, the 
amplified oncogenes have been found abund- 
antly expressed at the RNA level, roughly in 
proportion to the amount of DNA amplifica- 
tion (see Table 1). Described cases of elevated 
RNA expression include examples of abnor- 
mal (5, 22) and ectopic (6) transcription. In at 
least four cases this enhancement is not li- 
mited to synthesis of RNA (31, 33, 41, 74, 82). 
The Yl cells that have amplified c-Ki-ras con- 
tain exceptionally large amounts of its protein 
product situated on the plasma membrane 
(ref. 74, Fig. 7 and 8). High amounts of the 

I c-myc encoded protein are also found in 
COLO 320 cells that have amplified the gene 

} (33). The myc oncogenes have recently been 

1 shown to encode nuclear proteins (ref. 1, 3, 
26, 29, 32, 33, Fig. 9-11). Both the expression 
of the c-myc mRNA (39) and the subcel- 
lular localization of myc proteins are linked to 
the cell cycle (ref. 89, Fig. 12). It may be that 
elevated expression of specific c-myc func- 
tions is necessary for cell cycle progression 
and the growth transformation aspect of the 

| phenotype of cancer cells that may contribute 
to tumour progression (7, 36). Elevated ex- 

j pression of c-myc has been shown to replace 
J» part platelet-derived growth factor in in- 
duction of competence for DNA replication 
Generally, enhanced expression of an on- 
cogene could be a necessary prerequisite for 
acquisition of a growth advantage by cells 
having extra copies of the gene. This effect 



+ + + INDUCTION 

cq<0 a o cocoo 



43.000+ 



1 IB SI 1 
i i ill 



MW 
x10" 

93 
68 

+ 46 

30 
20 



Fig. 10. & coli 294 was transfected with the hybrid v-src, 
v-myc plasmid outlined in Fig. 9 and ampicillin-resistant 
bacterial colonies were checked for the production of a 
43,000 m.w. bacterial v-myc protein after induction by 
growth to different optical densities in minimal essential 
medium |M9, induction +) or complete medium (LB, in- 
duction—) (3). 



could also be the principal contribution of 
amplification to tumourigenesis. 



TUMOUR CELL AND STAGE SPECIFICITY 
OF ONCOGENE ACTIVATION AND 
AMPLIFICATION 

Tumour cell specificity of oncogene amplifi- 
cation has been found in three malignancies. 
The c-myc, L-myc or N-myc oncogene is ampli- 
fied in most cases of the variant form of small- 
cell lung cancer cells (53, 69), c-erbB is ampli- 
fied in several glioblastomas (Josef Schlessin- 
ger, personal communication) and the putative 
N-mvc oncogene is amplified in about half of 
grade III and IV neuroblastomas (14, 72, 73, 
75). In addition to HSR:s, small-cell lung can- 
cers and neuroblastomas frequently show a 
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Fig. 12. Fluorescent staining for DNA and myc protein in 
myeiocytomatosis virus-transformed quail cells. In in- 
terphase cells, the myc protein is confined to the nucleae. 
In the mitotic cell, myc fluorescence is distributed 
throughout the cell unlike fluorescence for chromatin, 
which is compacted to chromosomes in the metaphase 
plate. In fact, there is less myc fluorescence associated 
with chromatin than with the rest of the cell. DAPI, dia- 
minophenylindole DNA stain. The anti-myc protein rab- 
bit antiserum was used in a 1/200 dilution (ref. 89). 



deletion of a portion of the short arm of chro- 
mosome 1 (13) and chromosome 3 (86, 87), 
respectively, in karyological examination. 
Two kinds of changes have also been described 
in different neuroblastoma oncogenes. The 
first is a mutation in the N-ras gene, an ac- 
tivated oncogene that was discovered because 
of its relation to other ras genes and trans- 
forming activity in transfection .experiments 
(77). The second is amplification of a distant 
homologue of the c-myc gene called N-mvc (72, 
73, 75). Although the transforming potential 
of the N-myc gene has not yet been established, 
its consistent presence in a core segment of 
amplified neuroblastoma DNA (38, 57, 72, 73, 
75) and its elevated expression in most reti- 
noblastomas (48) suggests its oncogenic nature. 

Taya et al. (80) have recently described a 
human lung giant cell carcinoma grown in 
nude mice, where both c-Ki-ras and c-myc on- 



cogenes were amplified about 10-fold. Besides, 
sequencing studies indicated that at least 
some of the amplified c-Ki-ras copies were 
also mutationally activated in the 12th codon. 
These results fit to the multistage theory of 
cancer development and progression (see 58). 
Apparently co-operating lesions in cellular on- 
cogenes accumulate during tumour growth 
and selection and increase the malignant po- 
tential of the tumour cells (44). 

When does oncogene amplification come in- 
to play during tumourigenesis? Gene amplifi. 
cation may not be any initiating event in carci- 
nogenesis. Amplification and enhanced ex- 
pression of c-myc and N-myc may occur during 
the progression of human carcinoma of the 
lung and neuroblastoma cells to a more malig- 
nant phenotype (14, 53, 73). There may be, 
however, no mandatory sequence of onco- 
gene amplifications for the genesis of any par- 
ticular tumor. Amplification of an oncogene 
could play its part in malignant progression of 
already initiated cells whenever it happened 
to occur. 
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Abstract 

The human epidermal growth factor (EGF) receptor is known 
to be homologous to the \-erb B oncogene protein of the avian 
erythroblastosis vims. Overexpression of the EGF receptor 
gene in A431 epidermoid carcinoma cells is due to gene 
amplification. In this study, a variety of squamous ceil carci- 
nomas were examined and one, SCC-15, contained high levels 
of the EGF receptor as determined by immnnopretipitation via 
an EGF receptor-specific polyclonal antibody. Using a cloned 
EGF receptor complementary DNA as a probe, the level of 
EGF receptor RNA was found to be elevated four-fold in SCC- 
15 relative to normal cultured keratinocytes. When the same 
probe was used to identify EGF receptor gene fragments on a 
genomic DNA blot, the SCC-15 cell line was shown to possess 
an EGF receptor gene copy number amplified four to five 
times. Gene amplification results in the enhancement in the 
level of the EGF receptor in several carcinomas and could be 
responsible for the appearance of the transformed phenotype 
in these cells. 

Introduction 

The epidermal growth factor (EGF) 1 stimulates growth and 
elicits a wide variety of rapid and delayed responses by binding 
to high-affinity cetl-surfacc receptors which are 170-kD glyco- 
proteins (I). Recently, EGF receptor peptides have been se- 
quenced and found to be homologous to the avian erythro- 
blastosis virus erb B oncogene product (2), suggesting that the 
EGF receptor gene is the human oerb B oncogene. A431 
epidermoid carcinoma cells possess a very large number of 
EGF receptors (3), and the EGF receptor gene is amplified 
~30-fold (4-6). This amplification is responsible for the 
overexpression of the EGF receptor protein in these cells 
(4-6). 

A cell culture system has been developed permitting serial 
cultivation of keratinocytes, whose growth is modulated by 
EGF (7). Such methods have been used to establish cell lines 
from squamous cell carcinomas of the oral epithelium (8). 
Because of the role of EGF in keratinocyte development, we 
quantified EGF receptor protein and RNA in several squamous 
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cell carcinomas. One cell line, SCC-15, was found to contain 
high amounts of receptor protein and RNA, and a four- to 
fivefold amplification of the gene. 

Methods 

The squamous cell carcinomas established by Rheinwald and Beckett 
(8) were obtained from, and maintained according to the American 
Type Culture Collection (Rockvjlle, MD). 1623 was originally designated 
as SCC-15; 1628 as SCC-25; and 1629 as SCC-9 (8). Normal human 
esophageal epithelial cells were grown as reported (9). Maintenance of 
other cell lines was as described elsewhere (10). Proteins were labeled 
with { 34 S] methionine and immunoprecipitated as previously described 
(10). PolyA+ RNA was isolated by guanidine isothiocyanate solubili- 
zation and CsO centrifugation (II), and oligo(dT>affinity chromatog- 
raphy. RNA (Northern) blotting was performed as described (II, 12). 
High molecular weight DNA was isolated (4) and analyzed by DNA 
(Southern) blotting (4, 10, 13). The EGF receptor complementary 
DNA (cDNA) clone pE7 was constructed and isolated from an A431 
cDNA fibraiy (II). DNA fragments were ^-labeled by nick translation. 

Results 

A large number of cell lines were initially screened for EGF 
receptor levels by determining their ability to be killed by an 
EGF-pseudpmonas exotoxin conjugate, a technique described 
previously (10). Several squamous cell carcinomas were found 
to be relatively sensitive to the EGF-toxin conjugate, including 
SCC-25, SCC-9, and particularly SCC-15, all derived from the 
human tongue (8), These three cell lines were labeled with 
["SJmethionine, and their extracts immunoprecipitated with 
a goat polyclonal antibody to the EGF receptor, affinity- 
purified as described (10). When compared with A431 cells, 
which make very large amounts of the EGF receptor, SCC-25 
and SCC-9 make moderate amounts and SCC-15 high amounts 
of the receptor (Fig. 1, lane a vs. e. g. and c). Quantitation of 
the immunoprecipitation data revealed that SCC-15, SCC-25, 
and SCC-9 make 41, 15, and 4% of the amount of EGF 
receptor made by A431 cells, respectively. 

Because SCC-15 cells had high levels of receptor, polyA+ 
RNA was isolated from these cells, electrophoretically fraction- 
ated on agarose, and analyzed by RNA (Northern) blotting. A 
cloned A431 cDNA (pE7) encoding the EGF receptor (11) 
was 32 P-labeled and used as a hybridization probe to visualize 
EGF receptor RNAs. Fig. 2 A shows that SCC-15 contains 
both the 10- and 5,6-kilobase species of EGF receptor RNA. 
The levels are approximately four- to fivefold higher than 
those found in either KB or A498 kidney carcinoma cells; 
these cell lines were previously found to possess readily de- 
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tectablc levels of both receptor RNAs (10). Fig. 2 B shows that 
cultured human epithelial cells contain EGF receptor-specific 
RNA (HE1A, lane e) whose levels axe higher than an early 
passage human fibroblast D551 (lane/), equivalent to A498 
(lane g), but much lower than SCC- 15 (lane b). 

To determine if an elevated gene copy number was asso- 
ciated with enhanced expression of the EGF receptor gene in 
SCC- 15 cells, genomic DNA was isolated from normal cultured 
epithelial cells (HEIA) and SCC- 15 cells, digested with tfrodlll, 
electrophoreticaHy fractionated, and subjected to DNA blotting 
analysis. An EGF receptor cDNA (pE7) was used as a hybrid- 
ization probe to identify receptor DNA fragments. Fig. 3 A 
reveals that the SCC- 15 genome contains four- to fivefold 
amplified EGF receptor gene sequences relative to normal 
epithelial cells (lane a vs. b). Analysis of 0-actin gene fragments 
on the same filter by hybridization to a chick actin cDNA 
probe indicated that equal amounts of DNA were loaded per 
well (data not shown). Digested SCC- 1 5 DNA had to be 
diluted about fourfold (Fig. 3 B, lane e) to approximate the 
signal intensity of receptor DNA fragments from SCC-25, 
SCC-9, and KB cells (lanes g-i). The KB cell EGF receptor 
gene is known not to be amplified (10). 

Discussion 

It may be significant that A43 1 carcinoma cells are not unique 
in their possession of amplified EGF receptor genes. We report 
here that the EGF receptor gene in squamous cell carcinoma 
SCC- 15 is amplified four- to fivefold relative to normal epithelial 
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Figure 2. RNA blot analy- 
sis of polyA+ RNAs using 
the M P-labcled EGF recep- 
tor cDNA probe pE7. {A) 
and (B) are autoradiographs 
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Figure 3. DNA blot analysis of ////idlll-digested genomic DNAs 
using the pE7 probe (see Fig. 2). (A) and (B) arc autoradiograpbs 
from two separate gels. Sizes are in kilobase pairs (right). 10 of 
DNA was loaded except in (£), lane $ d-f, which represent serial 
dilutions of the 1623 DNA shown in lane c 



cells. The amplification of the EGF receptor gene may cause 
the initiation or maintenance of the malignant state in some 
human cells. 

Previously, we reported that a variety of transformed cell 
lines synthesize relatively high amounts of both EGF receptor 
protein and messenger RNA (10). It is conceivable that a 
moderate or even a small increase in the level of the EGF 
receptor leads to a change in the cellular phenotype, as has 
been demonstrated for the sre gene product (14). If this 
hypothesis is correct, then even a minor amplification of the 
EGF receptor gene copy number could contribute to the onset 
of tumorigenesis. Hendler and Ozanne (15) have examined 
lung squamous cell carcinomas and found that they contain a 
2.5-5-fold increase in EGF receptor levels. 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and invasive Human Transitional 
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Gain and loss of chromosomal material is characterfetlo 
of bladder cancer, 83 weti a$ roaHgnant tramf ormafion In 
general. The consequences of these changes at both the 
transcription and translation levels Is at present unknown 
partly because of technical Imitations. Here we have at- 
tempted to address this question In pairs of noninvasive 
and Invasive human bladder tumors using a combination 
of technology that Included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels 15600 genes), and high resolution 
two-dimensional gel electrophoresis* the results showed 
that there is a gene dosage effect flat In some cases 
superimposes on other regulatory mechanisms* This ef- . 
feet depended (p < 0.015* on the magnitude of the com- 
parative genomic hybridization change. In general (IS of 
23 cases), chromosomal areas with more than 2-fold gain 
of OKA showed a corresponding increase In mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript leveK Be- 
cause most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations Id relatively few oases of well focused 
abundant proteins. $£tth few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The Implications, as wefl as limitations, 
of the. approach are discussed. Molecular 6 Cellular 
Proteomlcs 1&7-45, 2002. 

■fi ■ ■ * . 

, Aneuptoidy Is a common feature of most human cancers 
(1). but little is known about the genome-wide effect of this 
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phenomenon at both the transcription and translation Jevels* 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there is a correlation between 
DNA copy numbers and gene expression In highly amplified 
areas (2), and studies of Individual genes in solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels In the case of c-erb-82, cyclfn d1 t ' 
ems1 t and N-myc (3-5). However, a high cyclln 01 protein 
expression has been observed without simultaneous am- 
plification (4), and a low level of c-myc copy number In- 
crease was observed without concomitant c-myo protein 
overexpresslon (6). . 

In human bladder tumors, karyotyping, fluorescent In situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of noninvasive pTa transitional cell carcinomas fJCCs), 
this Includes toss of chromosome 9 or parts of ft, as well as 
loss of Y in males. In minimally Invasive pTI TCCs, trie fc4- 
lowing alterations have been reported: 2q- ( 11p~, 1q+, 
11q13+, 17q+, ana* 20q+ (7-12). It has been suggested that 
these regions .harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas Involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult 

In this Investigation we-have combined genome-wkie tech- 
nology for detecting genomic gate and losses (CGH) with 
gene expression profiling techniques (rdcroarrays and pro- 
teomlcs) to determine the effect of gene copy number on 
transcript and protein levels In pairs of non-Invasive and In- 
vasive human bladder TCCs, 

EXPERIMENTAL PROCEDURES 

Materia/— Bladder tumor biopsies, ware sampled after informed 
consent was obtained and after removal of tissue for routine pathol- 
ogy examination. By light microscopy tumors 335 end 532 were 
staged by an experienced pathologist as pTa (superficial papillary), 

1 The abbreviations used are: CGH, comparative genomic hybrid- 
ization; TCC, transitional ceil carcinoma; LOK lose of heterozygosity; 
PA-FABP. psoriasis-associated tatty aofd-Wndlng protein; 2D, 
two-dimensional. 
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- eSlT^l^ZltS fT", fr ° m ,' eft » are GGH profile*, gene tocation and 
compared ^^S^^J^^^^^* ° n ? * h9 chramosome - A - «^ssi?n of mRNA In Invasive tumor 733 as 
coZ^TLis-^^ ^^T^ .T* B ' ex P rassion °» «"RNA in invasive tumor 827 compared with the non-tnvasfve 
^Z^bl^f^r^i^^ S '? nal rati ° ,umor DNA n0fmal DNA ,s shown al °«3 ^ length of the Xomo^me 

HaZ^ec^^SS ^T? 9 T °' fCmr chro ™ somes ^ is surrounded by thin curves ideating oneXSd 

lncrta^t*c« >^5SSSw^Sl!^.l gene in the ™aslve tumor compared with the noninvasive counted: >2-told 



grade I and II, respectively, tumors 733 and 827 were staged as pTI 
Onvasive Into submucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparaf/on-Tlssue biopsies, obtained fresh from surgery 
were embedded immediately In a sodium-guanldinlum thiocyanate 

pma °? Tl St0red at ~ 80 ° C ' Total RNA was ,so,ated using the 
RNAzol^ B RNA isolation method (WAK-Chemle Medical GMBH} 

SWwt^a^ ***** ^ " ° fl90(<n) 8G,eCt,0n St6P (0,, 9° teK 
CRAM Preparatfon-1 M g of mRNA was used as starting material. 
* ^?J* cond st rand cDNA synthesis was performed using the 
Superscripts choice system (Invltrogen) according to the manufac- 
turers Instructions but using an oligo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip® tn wtro transcription Wt (Ambton). BlotMabeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs In the reaction 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy columns (CHagen). 

Array Hybridization and Scanning- Array hybridization and scan- 
ning was modified from a previous method (13). 10 ^ of cRNA was 
fragmented at 94 »C for 35 min in buffer containing 40 mM Tris 
acetate, pH 8.1, 100 mM KOAc, 30 mM MgOAc. Prior to hybridization 
the fragmented cRNA In a 6x SSPE-T hybridization buffer (1 m NaCI* 
10 mM Tris, pH 7.6, 0.005% Triton), was heated to 95 *G for 5 min* 
subsequently cooled to 40 and loaded onto the Affymetrix probe 
array cartridge. The probe array was then incubated for 16 hat 40 °0 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes In 6X SSPE^T at 25 ft C followed by 4 washes In 0£x SSPE^T 
at 50 ♦C. The biollnylated cRNA was, stained with a strsptavtoTn- 
phycoerythrln conjugate, 10 *ig/ml (Molecular Probes) In 6x SSPE-T 
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Fig. 1— -continued 



for 30 nun at 25 *C followed by 1 0 washes in 6x SSPE-T at 25 *C. The 
probe arrays were scanned at 560 nm using a confocal laser scanning 
microscope (made for Affymetrix by Hewlett-Packard). The readings 
from the quantitative scanning were analyzed by Affymetrix gene 
expression analysis software. 

Micrvsatetiito Analysis -MfcrosateUite Analysts was performed as 
described previously (14). MicrosateUites were selected by use of 
wwwjiobLnlm.nih.gov/genemap98, and primer sequences were ob- 
tained from me genome data base at www.gdb.org. DNA was extracted 
from tumor and Wood and amplified by PCR In a volume of 20 p| for 35 
cycles. The ampflcons were denatured and electrophoresed for 3 h in an 
ABI Prism 377. Data were collected in the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less than 33% 
of one allele detected in tumor amplicons compared with blood. 

Proteomic Anafysis-TCCs were minced Into small pieces and 
homogenized in a small glass homogenizer in 0.5 ml of lysis solution. 
Samples were stored at -20 °C until use. The procedure for 2D gel 
electrophoresis has been described in detail elsewhere (15, 16). Gels 
were stained with silver nitrate and/or Coorhassie BriQIant Blue, Pro- 
teins were Identified by a combination of procedures that included 
microsequencing, mass spectrometry, two-dimensional gel Western 
immunoblotting, and comparison with the master two-dimensional gel 
Image of human keratinocyte proteins; see biobase.dk/cgi-btn/ceQs. 

CGH- Hybridization of differentially labeled tumor and normal DNA 
to normal metaphase chromosomes was performed as described 
previously (10). Ruorescelrv-labeled tumor DNA €200 ng), Texas Red- 



labeled reference DNA (200 ng). and human Cot-t DNA (20 /xg) were 
denatured at 37 °C for 5 mln and applied to denatured normal met- 
aphase slides. Hybridization was at 37 "C for 2 days. After washing, 
the slides were counterstained with 0.15 jtg/mJ 4 r 6-diamidirvo-2-phe^ 
nyfindote in an anti-fade solution. A second hybridization was per- 
formed for all tumor samples using fluoresceln-labeted reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) to confirm the 
aberrations detected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red-labeled normal DNA. Digital Image analysis was 
used to identify chromosomal regions with abnormal fluorescence 
ratios, indicating; regions of DNA gains and losses. The average 
green:red fluorescence intensity ratio profiles were calculated using 
four Images of each chromosome (eight chromosomes total) with 
normalization of the green:red fluorescence intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4.6-dlamidino-2*phenyUndole band- 
ing patterns. Only images showing uniform high Intensity fluores- 
cence with minimal background staining were analyzed. Ail 
centromeres, p arms of acrocentric chromosomes, and rwterochro- 
matic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization-lbe CGH analysis 
identified a number, of chromosomal gains and losses In the 
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Table I 

Correlation between alterations detected by CGH and by expression monitoring 

Top, CGH used as Independent variable (if CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable (if expression alteration - what CGH deviation was found). 



CGH alterations 



Tumor 733 vs. 335 
Expression change clusters 



Concordance 



CGH alterations 



Tumor 827 vs. S32 
Expression change clusters 



Concordance 



13 Gain 



10 Loss 



10 Up-regulation 

0 Down-regulation 

3 No change 

1 Up-regulation 

5 Down-regulation 

4 No change 



77% 



50% 



10 Gain 
12 Loss 



8 Up-regulatlon 
0 Oown-regulation 

2 No change 

3 Up-regulation 

2 Down regulation 
7 No change 



Expression change clusters 



Tumor 733 vs. 335 
CGH alterations 



Concordance Expression change clusters 



Tumor 827 vs. 532 
CGH alterations 



80% 
17% 

Concordance 



16 Up-regulation 
21 Down-regulation 
15 No change 



11 Gain 

2 Loss 

3 No change 
1 Gain 

8 Loss 

12 No change 
3 Gain 

3 Loss 

9 No change 



69% 
38% 
60% 



17 Up-regulation 
9 Down^reguiation 
21 No change 



10 Gain 

5 Loss 

2 No change 

0 Gain 

3 Loss 

6 No change 

1 Gain 
3 Loss 

17 No change 



59% 
33% 
81% 



two Invasive tumors {staige pT1, TCCs 733 and 827), whereas 
the two non-invasive papillomas (stage pTa, TCCs 335 and 
532) showed only 9p-, 9q22-q33-. and X-, and 7+, 9q-, 
and Y-, respectively. Both invasive tumors showed changes 
(1q22-24+, 2q14.1-qter-, 3q12-q13.3-, 6q12-q22-, 
9q34+, 11q12-q13+, 17+, and 20q11.2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown In Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chfomosomal 
content, were observed at 1q23 In TCC 733 (Fig. 1A) and 
20q12 in TCC 827 (Fig. 1B). 

mRNA Expression in Relation to DNA Copy Number -The 
mRNA levels from the two Invasive tumors (TCCs 827 and 
733) were compared with the two non-Invasive counterparts 
(TCCs 532 and 335). This was done In two separate experi- 
ments In which we compared TCCs 733 to 335. and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scajmg as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way It was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the Invasive versus the non-invasive counterpart Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fold were regarded as Informative (Rg. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations In these 
chromosomal areas. In general, areas with a strong gain of 
chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, froth chromo- 
somes 1q21-q25, 2p arid 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels In the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an Increased level of transcripts In both 
TCCs 733 (77%) and 827 (80%) (Table l f top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often, regis- 
tered as having unaltered RNA levels (Table I, fop). The inabH^ 
Ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2-fold as the Independent variable and es- 
timated the frequency of CGH alterations In these areas. As 
above, we found that Increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected In areas with unaltered CGH ratios (Table I. bottom). 
Furthermore, as a control we looked at areas with no alter- 
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A 
AAA 



AAA. 
U 



,1 



Expression changes 
detected 



Expression changes 
not detected 



Expression changes 
detected 



Expression changes 
not detected 



Tumor 827 versus 532 Tumor 733 versus 335 

Fia2 Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide arrav 
moratory. 1^ aberration is shown as a numerical -fold change in ratio between invasive turners 827 (A) ar>d733(4) K 
counterparts 532 and 335. The expression change was taken from the Expression line to the rigM n r£ l.^cn^S ITrS^ 
e^onchange for a given chmmosomai region. At least half of the mRNAs from a given regkxfhave to be e^u^o^r^rS^ 
rwSded^ *™»^ — ^ which the CGH ratio plus or minus one standard 



ation in expression. No alteration was detected by CGH In 
most of these areas (TCC 733, 60% and TCC 827, 61%; see 
Table I, bottom). Because the ability to observe reduced or 
Increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations In the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2)v£or both tumors TCC 733 (p < 0.01 5) and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the DNA copy number) and alterations detected by the array 
based technology {Fig. 2^ Similar data were obtained when 
areas with altered expression were used as Independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction In expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
much larger extent. 

Microsatettite-based Detection of Minor Areas of Loss- 
es- In TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1, TCC 733 
chromosome 1q32, 2p21, and 7q21 and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two rnlcrosatellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Ffg. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. 1 A). As Indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
Implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome 11p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D11S1760, D11S922) 
positioned close to MUC2, IGF2, and cathepsin D Indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, I2p12.2, 12q21/l, and 16q24 
and In TCC 827 at chromosome 11p15.5, 12p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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Fw. 3. Microsateilite analysis of loss of heterozygosity. Tumor 
733 showing toss of heterozygosity at chromosome 1q25. detected 
(a) by D1S215 close to Hu class I histocompatibility antigen (gene 
number 38 in Rg. 1), (b) by D1SZ735 close to cathepstn E (gene 
number 41 in Rg. 1), and (c) at chromosome 2p23 by D2S2251 close 
to general ^-spectrin (gene number 1 1 on Rg. 1 ) and of (d) tumor 827 
showing toss of heterozygosity at chromosome 18q12 by S18S11 18 
close to mitochondrial 3-oxoacyl-coenzyme A thiolase (gene number 
12 in Fig. 1). The upper curves show the electropherogram obtained 
from normal DNA from leukocytes (A/), and the lower curves show the 
electropherogram from tumor DNA (7). In alt cases one allele is 
partially lost in the tumor ampllcon. 

showing reduced mRNA transcripts. Only tho microsatetJite 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes In the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels- 
2D-PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all four tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating In areas away from the edges of the pH 



H AAAAA O J 



*8- 



o 

4U- 



•JC 

It 
V 



C D 



*A4 ♦ 



Reduced 
protein 



Unaltered 
protein 



Increased 
protein 



Fia 4. Correlation between protein levels as judged by ZD- 
PAGE and transcript ratio. For comparison proteins were dMdecf In 
three groups, unaltered In level or up- or down-regulated horizontal 
axis). The mRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene {vertical axis). A, mRNAs mat were scored as 
present in both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent in the invasive tumors (along horizontal axis) or 
as absent in non-invasive reference (fop of figure). Two different 
scalings were used to exclude scaling as a confounder, TCCs 827 
and 532 (AA) were scaled with background suppression, and TCCs 
733 and 335 (#0) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences in mRNA ratios 
between the groups. Proteins shown were as follows: Group A (from 
fe/r), phosphoglucomutase 1 . glutathione transferase class m number 
4, fatty acid-binding protein homologue, cytokeratln 15 # and cyto- 
keratjn 13; S (from (eft), fatty acid-binding protein homologue, 28-kDa 
heat shock protein, cytokeratln 1.3, and carcyclin; C(from /eft), <r-eno- 
lase, hnRNP 81, 28-kDa heat shock protein, 14-3-3-^ and 
pre-mRNA splicing factor; D, mesothelial keratin K7 (type II); B (from 
fop), glutathione S-transferase-tr and mesotheflai keratin K7 (type II); 
F(from fop and left), adenytyl cyclase-associated protein. E-cadherin, 
keratin 19, calglzzarin, phosphoglycerate mutase, annexm IV, oy- 
toskeletal y-actln. hnRNP At, integral membrane protein calnexJn 
(IP90). hnRNP H. brain-type ciathfin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translationatly controlled tumor protein, liver gtycetatdehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na*K- 
ATPase 0-1 subunit; G. (from top and left), TCP20, calgizzann, 70- 
kDa heat shock protein, calnexin, hnRNP H. cytokeratln 15, ATP 
synthase, keratin 19, triosepbosphate Isomerase, hnRNP F. liver glyc- 
eralaehyde-3-phosphatase dehydrogenase, glutathione S-transfer- 
ase-tr, and keratin 8; H (from fen), plasma gelsbiin, autoanOgen cal- 
reticulin, thioredoxin, and NAD+-dependent 15 hydroxyprostaglandin 
dehydrogenase; / (from fop), prolyl 4-hydroxylase /J-subunlt, cyto- 
keratln 20, cytokeratln 17, prohibition, and fructose 1,6-Wphos- 
phatase; J annexin II; K, annexfn IV; L (from top and fe/r}. 90-kDa heat 
shock protein, prolyl 4-hydroxylase 0-subunrt, a-enolase. GRP 78, 
cyclophilin, and cofilin. 

gradient, and having a known chromosomal location, were 
selected for analysis in the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Rg. 
4). Only one gene showed disagreement between transcript 
aiteration and protein alteration. Except for a group of cyto- 
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keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected In TCCs 733 and 335, and of these 1 9 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed In the non-invasive TCC 335 but lost In the Invasive 
counterpart (TCC 733; see Fig. 5). The smaller number of 
proteins detected In both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes In transcript levels also 
showed corresponding changes In the protein level (Table II). 
These regions Included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20. annexlns It and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area In invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence as to the effect of these gains 
and losses on gene expression in two pairs of noninvasive 
and invasive TCCs using high throughput expression arrays 
and proteomics. In combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited Increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion^ The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 

Table II 



Proteins whose expression level correlates with both mRNA and gene dose changes 


Protein 


Chromosomal location 


Tumor TCC 


CGH alteration 


Transcript alteration* 


Protein alteration 


Annexin II 


1q21 


733 


Gain 


Abs to Pres* 


Increase 


Annexin IV 


2p13 


733 


Gain 


3.9-Fold up 


Increase 


Cytokeratln17 


17q12-q21 


827 


Gain 


3.8-Fold up 


Increase 


Cylokeratin 20 


17q2t,1 


827 


Gain 


5.6-Fold up 


Increase 


(PA-)FABP 


8q21.2 


827 


Loss 


10-Fold down 


Decrease 


FBP1 


9q22 


827 


Gain 


2.3-Fold up 


Increase 


Plasma gelsolin 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heat shock protein 28 


15q12-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Profubftin 


17q21 


827/733 


Gain 


37-/2.5-Fold up* 


Increase 


ProryM-hydroxyl 


17q25 


827/733 


Gain 


5.7-/1 .6-Fold up 


Increase 


hnRNPBI 


7p1S 


827 


Loss 


2.5-Fold down 


Decrease 



* Abs, absent; Pres. present. 

* In cases where the corresponding alterations were found In both TCCs 827 and 733 these are shown as 827/733. 




Fkl 5. Comparison of protein and transcript levels In invasive 
and non-Invasive TCCs, The upper part of the figure shows a 2D gel 
Heft) and the oligonucleotide array Iffght) of TCC 532. The red rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 2D gels of TCCs 532 and 827 are shown below. 
Clearly, cytokeratins 13 and 15 are strongly down-regulated in TCC 
827 (red annotation). The tile on the array containing probes for 
cytokeratin 15 is enlarged below the array (red arrow) from TCC 532 
and is compared with TCC 827. The upper row of squares in each tile 
corresponds to perfect match probes; the lower row corresponds to 
mismatch probes containing a mutation (used for correction for un- 
spectfic binding). Absence of signal Is depicted as black, and the 
higher the signal the fighter the color. A high transcript level was 
detected in TCC 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13. a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the bottom of the figure QefQ show levels of PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 (Invasive), respectively. Both proteins are 
down-regulated in the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript A medium transcript level was de- 
tected in the case of TCC 335 (1 277 units) whereas very low levels 
were detected In TCC 733 (166 units). IEF. isoelectric focusing. 
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ever, an increase or decrease In DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectively. Some of these transcripts could not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels in areas with DNA amplifications in the inva- 
sive tumors (e.g. in TCC 733 transcript from cellular ligand of 
annexin II gene (chromosome 1q21) from absent to 2670 
arbitrary units; in TCC 827 transcript from small proline-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material in this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie in the loss of controlled 
methylaUon in tumor cells (17-19). Thus, it may be possible . 
that in chromosomes with increased DNA copy numbers two 
or more alleles could be demethylated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression in yeast, but In this case all 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these Include 9p-, 9q-, 1q+, Y- 
(2, 6), and in pT1 tumors, 2q-,11p-. 11q-, 1q+, 5p+, 8q+, 
17q+. and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and Y— , respectively, likewise, the two minimal Invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 1 1q14-q22 loss, the latter 
often linked to 17 q+ (both tumors), and 1q+ and 9p~, often 
linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general Importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it is only possible to get a crude picture of 
chromosomal Instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with Increased copy numbers. Analysis of these regions 
by positioning heterozygous mlcrosateiiites as dose as pos- 
sible to the locus showing reduced gene expression revealed 
loss of heterozygosity in several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA mlcroarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations- At present we do not know the mechanism be- 
hind chromosomal aneuploldy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles, A mechanism as genetic imprinting has 
an Impact on the expression level In normal cells and is often 
reduced In tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard it as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close, and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression Is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and UDH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA. and protein levels 
showed a striking correspondence although In some cases 
we found discrepancies that may be attributed to translational 
regulation, post-transtationa! processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
translationaliy inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very Important in the case of polypeptides with a short 
half-life (e.g. signaling proteins), A poor correlation between 
mRNA and protein levels was found in fiver cells as deter- 
mined by arrays and 20-PAGE (25), and a moderate correla- 
tion was recently reported by Ideker et ai. (26) In yeast 

interestingly, our study revealed a much better correlation 
between gained chromosomal areas and Increased mRNA 
levels than between loss of chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript. One possible 
explanation could be that by losing one allele the change In 
mRNA level Is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfold Increase In gene copy number resulting In a much 
higher impact on transcript level, the latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2 -fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may In the future allow detection of lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, and protein level Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area In chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated Is the large 
extent of protein modification that occurs after translation, 
requiring ImmunoidentificatJon and/or mass spectrometry to 
correctly Identify the proteins In the gels. 

In conclusion, the results presented In this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microanrays with many thousand radiation 
hybrki-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-translationai level, a clearer and more developed un- 
derstanding of the tumor genome will be forthcoming. 
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ABSTRACT* 



e changes underlie tumor progression and nay lead to cancer- 
xpresslon of critical genes. Over 110Q publications have de- 



Genetic i 

specific expression . ... „. w „„_. 

; scribed the use of comparative genomic hybridization (CGH) to analyze 
tht partem of copy number alterations In cancer, bat very few o/thegene* 
affected are known. Here, we performed bJgh-resolotion CGH analysis on - 
cDNA mkroarrayi In breast cancer and directly compared copy number 
and mRNA expression levels of 15324 .genes to qaantitate the impact of 
genomic changes on gene expression, We Identified and mapped the 
boundaries of 24 independent ampBeons, ranging In she from (U to 12 
Mb, Throaghont the genome, bom high* and low-level copy number 
changes had a substantial Impact on gene expression, with '44% of the. 
highly amplified genes showing overexpression and. 103% of the highly 
ovtrexprcssed genes being amplified. Statistical analysis with random 
permutation tests identified 270 genca whose expression levels across 14 
samples were systematically attributable to gene amplification.. T>e»e 
in eroded most previously described amplified genea in breast cancer and ' 
many novel targets for genomic' alterations, including the BOXB7 gene, 
the presence of which In a novel amplicon at 17o21l3 was validated in 
10.2K of primary breast cancers and associated with poor patient pros> 
nods. In conclusion, CGH on cDNA microairays revealed hundreds or. 
novel genes whose overexpression U attribntable to gene amplification. 
Theie genes may provide Insights to the clonal- evomtion and progression 
.of breast cancer and highlight promising therapeutic targets, 

INTROBUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries some of which may exrdain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the. molecular 
mechanisms underlying gene expression patterns in cancer have re* 
mamed^efosive, and the utility of gene expression profiling m the 
id^tification of specific therapeutic targets remains JunffiE* 1 ' ** ' 
- Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer. Identification of the genes that mediate, (he effects 
of genetic changes may be, hnportant by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets, for' anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as &RBB2 andBGFJR (7, 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, oyer 
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Fig. J.In^aaorgctwcopyipmbc-cn'glotaJ g«»eKprwboItv«U.^ pc^«nt> 4 ocf 
°w mod uodcrexprasted=geaai (T tad*) according to copy mxnbqWos (T arfr). 
Threehold valves used for Over- M d lAidorexptrcsifon woo >Z184 (global upper lYt of 
the cDNA ratios) end <QMU (globe) lower 7H of (be expression ratios). B, 
of amplified and deleted genes according to expression alios. TbeshoVd 
amplification end deletion were >\JS end «L7. ' ' 
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20. recurrent regions of DNA ampiification have been mapped in 
breast cancer by CGH 5 . (9, 10).. However, these ampiicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes, that, are attributable to underlying gene cop/ 
'Support FoundiUoi, the Finnish number iterations highlight transcripts that are. actively in- 

ocer Society,: the Pixtanoaa Cancer Society, the Ptrkanznaa Cultural Fwradsdon, the VOlved m mc causation or maintenance of,the malignant phenotype. 
isiss ^cart Cancer Oroun, Uc Foundation for the Development of laboratory mb*v To identify such transcripts, we applied a ccmbmaition ofcDNA and 

CGH microarrays to: (a) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and (b) identify and characterize those genes, whose mRNA expro 
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sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND METHODS 

Breast Cancer Cell Lines. Fourteen breast cancer cell lines (BT-20 BT- 
474 t HCC1428, Hs578t, MCF7, MDA-361. MDA-436, MDA-453. MDA-468 
SKBR-3, T-470, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and mRNA were isolated 
using standard protocols. 

Copy Number and Expression Analyses by cDNA Mkroarrays. The 
preparation and printing of die 13,824 cDNA clones on glass slides were 
farmed as described (1 1-13). Of these clones, 244 represented uncharac- 
terized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14 
15). Briefly, 20 /ig of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h with AJul and Rsal (Life Technol- 
ogies, Inc^ Rockville. MD) and purified by phenol/chloroform extraction. Six 
Jig of digested cell line DNAs were labeled with Cy3-dUTP (Amcrsham 
Pharmacia) and normal DNA with CyS-dUTP (Amcrsham Pharmacia) using 
the Btopnme Ubeling kit (Life Technologies, Inc.). Hybridization (14, 15) and 
posthybndization washes (13) were done as described For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagcne, 
La Jolla, CA) was used in all experiment*. Forty M g of reference RNA were 
labeled with Cy3-dUTP and 3.5 M g of test mRNA with Cy5-dUTP, and the 
labeled cDNAs were hybridized on microarrays as described (13,15) For both 
microarray analyses, a laser confocal scanner (Agilent Technologies, Palo 
Alto, CA) was used to measure the fluorescence intensities at the target 
locations using the DEARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. For 
the copy number analysis, the ratios were normalized on the basis of the 
distnbuuon of ratios of all targets on the array and for the expression analysis 
on the basis of 88 housekeeping genes, which were spotted four times onto the 
array. Low quality measurements (i.e.. copy number data with mean reference 
intensity <100 fluorescent units, and expression data with both test and 
reference mtemtty <100 fluorescent units and/or with spot size <50 units) 



were excluded from the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define cutpotnts for increased/ 
decreased copy number. Genes with CGH ratio > 1,43 (representing the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0.73 (representing the lower 5V») were considered to be 
deleted. 

Statistical Analysb of CGH and cDN A Microarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled I for amplification (ratio, > J. 43)' and 0 for no amplification. 
Amplification was correlated with gene expression using the signal-to-noise 
statistics (I). We calculated a weight, w r for each gene as followa: 

where m glt <r Bl and cr^ denote the means aod SDs for the expression 
levels for amplified and nonamplified cell lines, respectively. To assess the 
statistical significance of each weight, we performed 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation man the original Weight was denoted by a. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and AropUcou Mapping. Each 
cDNA clone on the microarray was assigned to a Unigene cluster using the 
Unigene Build 141. 6 A database of genomic sequence alignment inrormaUon 
for mRNA sequences was created froro the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoidcnPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Amplicons were de6ned as a CGH copy number ratio >2.0 in at kast two 
adjacent clones in two or more cell lines or a CGH ratio >2,0 in at least three 
adjacent clones in a single cell line. The amplicon start and end positions were 
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Table I Summary qf independent ampHcot* tn 14 brtast can 
CGH microarray 
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CGH were validated, with lq2.1, 17qi2-q2L2, 17q22-q23, 20q.i3.i, 
and 20ql3.2 regions being most commonly amplified. Furtherx-aore,' 
me boundaries of these amplicons were precisely delineated. Iq ad- 
dition, novel amplicons were identified at 9pl3 (38.65-39.25 3Mb) 
and 17q2U (52.47-55.80 Mb). . ' 

Direct Identification Of Putative Amplification Target G^nes. 
The cDNA/CGH microarray technique enables the direct cojtrela- 
tion of copy number and expression data on a gene-by-gene basis 
throughout the genome. We directly annotated high-resolxuioo 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at tpl3, 17q22~q23, and 20ql3 were higbty ovcrex- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates EGFR as the most highly overexpressed and amplified 
gene at 7pl i-pl2 (Fig. 34). In BT-474, the two known ampl icons 
at 17ql2 and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig. 3fl). In addition, several genes, including the 
homeobox genes HOXB2 and HOXB7, were highly amplified fa a 
previously undescribed independent amplicon at 17q21.3. HOJCB7 
was systematically amplified (as validated by FISH, Fig. 30, inset) 
as well as overexpressed (as verified by RT-PCR* data not shown) 
in BT-474, UACC812, and ZR-75-30 cells. Furthermore, this novel 



extended to include neighboring nonamplificd clones (ratio, < 1.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISR to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RPl 1-36 IKS was la- 
beled with SpcctrumOraoge (Vysta, Downers Grove, 1L), and "Spedrum- 
Orange-labeled probe for EGFR was obtained from Vysis, SpectrurnGreen- 
labeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference, A tissue microarray containing 612 formalin-fixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
(1 SX The use of these specimens was approved by the Ethics Committee of the 
University of Basel and by the MH. Specimens containing a 2-fold or higher 
increase in the number of test probe signals, as compared with corresponding 
centromere signals, in at least 10% of the tumor cells were considered to be 
amplified Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test. 

RT-PCR. The HOXB7 expression level was determined relative to 
GAPDH. Reverse transcription and PCR amplification were performed, using 
Access RT-PCR System (Promega Corp., Madison, Wt) with 1 0 ng of mRNA 
as a template. HOXB7 primers were 5'-GAGCAGAGGGACTCGGACTT-3' 
and 5 ' -GCGTCAGGT AGCG ATTGT AG-3 ' . 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH raicroarrays) in 14 breast cancer cell 
lines, the results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexpressed (i.e.. belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. I A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. Iff). Low-level copy number 
increases and decreases were also associated with similar, although 
less dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 1 1 ,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig, A). The average spacing of clones throughout the genome 
was 267 kb. This highresohition mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 
1). Several amplification sites detected previously by chromosomal 





Fig. 3. Annotation of gene expression data on CGH microarray profile*. A, genes in the 
7pU-pl2 ifflplicoo in the MDA-468 cell Hoc ore highly expressed {red dots) and include 
the EGFR oncogene. B, ccveral genes in the I7ql2, 17q2l.3, tnd I7cj23 amplicons to the 
BT-474 breast cancer cell line arc highly overexpressed (red) and include the HOXB7 
gene. The data labels and color coding arc AS Indicated for Fig. 2C Insets show 
chronosomal CGH profiles for (he corresponding chromosomes and validation of the 
increased copy number by interphase FISH using EGFR (red) and chromosome 7 
centromere probe (green) to MDA-468 {A) and r70AB7-specific probe {red) and chro- 
mosome 17 centromere (green) to BT-474 cells (S). 
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Fig. 4. List of SO genes with a statistically 
signlftcani correUtloa (a value <0.O5) between 
gene copy number and gene expression. Name, 
chromosomal location, and the a value for each 
gene are indicated- The gene* have been ordered 
according to (heir position in the genome. The color 
maps on the right illustrate the copy number and 
expression ratio patterns in the 14 ceil lines. The 
key to the color code is shown at the bottom of the 
graph. Groy squares, missing values. The complete 
list of 270 genes is shown in supplemental Fig. B. 
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amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P = (KOOi). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Amplicons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across all 14 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data, 8 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 151 
(84%) are implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
that could not be directly linked with cancer. 



1 Internet address; httpyAvww.geneontology.or^. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in >i000 publications applying CGH* (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19-21). Here, we applied genome- 
wide cDNA microarrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of high- 
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level copy number increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes in the 
wgions affected, but these effects were more subtle on a gene-by-gcne 
basis (ban those of high-level amplifications. However, the impact of 
tow-level gains on the deregulation of gene expression patterns in 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of genetic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns io yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24). 

The CGH raicroarray analysis identified 24 independent breast 
cancer ampl icons. We defined the precise boundaries for many am- 
plicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel amplicons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons. One of these novel amplicons 
involved the bomeobox gene region at I7q2l.3 and led to die over- 
expression of the HOXB7 and HOXB2 genes. The homeodomain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogenesis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for overexpressing HOXB7 in breast cancer and suggest that 
HOXB7 contributes to tumor progression and confers an aggressive 
disease phcnotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing -2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2, MYC, 
EGFA, ribosomal protein s6 kinase, and AJB3, but also numerous 
novel genes such as hi RAS-r elated gene (lpl3), syndecan-2 (8q22), 
and bone morphogenic protein (20ql3.1), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of tie 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we demonstrate application of cDNA raicroarrays 
to the analysts of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (6) a high-resolution map of 24 independent 
amplicons in breast cancer; and (c) identification of a set of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel amplicon at 
17q2i.3 implicated amplification and overexpression of the 
HOXB7 gene, in breast cancer, including a clinical association 



between HOXB7 amplification and ppor patient prognosis. Or^xg^ 
our results illustrate how the identification of genes activate^ by 
gene amplification provides a powerful approach to higruiight 
genes with an important role in cancer as well as to prioritizes and 
validate putative targets for therapy development. 
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Microarray analysis reveals a major direct role of 
DNA copy number alteration in the transcriptional 
program of human breast tumors 

Comprehensive Cancer Center, Un^.ty^ ™* <* <^etlcs and Uneherger 

Contributed by Patrick O. Brown. August 6. 2002 



Genomic DNA copy number alterations are key genetic events in 
the development and progression of human cancers. Here we 
SST V eno ^ ide mlcroaTr *y comparative genomic hybrid- 
ation (array CGH) analysis of ONA copy number variation In 
a series of primary human breast tumors. We have profiled ONA 
copy number alteration across 6.691 mapped human genes, in 44 
predominantly advanced, primary breast tumors and 10 breast 
cancer cell lines. While the overall patterns of DMA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
«oluuor i (gene-by^gene) mapping of amplicon boundaries and 
hZ^^ ^l^^n * ampHcan shape provide significant 
fmprovement In the localization of candidate oncogenes. Parallel 
microarray measurements of mRNA levels reveal the remarkable 
degree to which variation In gene copy number contributes to 

thT£Z XlZ^^l" tUm ° r Celb ' we find 

that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number Influences gene ex- 
pression across a wide range of ONA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, 
a 2-fold change in DNA copy number is associated with a corre- 
sponding 1.5-fold change in mRNA levels, and that overall, at least 
12% of aD the variation In gene expression among the breast 
2T2 ^^^"^We to underlying variation In gene copy 
number. These findings provide evidence that widespread DMA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 
genomic hybridization (CGH) (1), have led to the identifi 
cation of a number of recurrent regions of DNA copy number 
alteration m breast cancer cell lines and tumors (2-4). While 
ff me £ "#? ns ^ known or candidate oncogenes 
ttl^ GFR J K } ^ C (8q24 >' ^NDl (Uql3), ERBB2 
Rm ^ a ?f^™i^ ql3 > ] and tun,or s WpresSor genes 
ffiSr T ? 3 ( }l ?m rclevant Sene(s) within 

other regions (e g., gam of lq, 8q22, and I7q22-24 t and loss of 
8p) remain to be identified. A high^rcsolution genome-wide 
map, delineating the boundaries of DNA copy number alter- 
ations m tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer in this study, we have created such a map, using 
array-based CGH (5-7) to profile DNA copy number altera* on 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
ELSE? a"™ 0 " aI l Cr of genes within involved 

regions. Because we had measured mRNA levels in parallel in 
the same samples (8), using the same DNA microarrajs, we had 
S» <0 «l*>5f OA a genomic scale the relationship 
between DNA copy number changes and gene expression; From 



this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the transcriptional 
programs of breast tumors. 

Materials and Methods 

Tumors and Cell lines. Primary breast tumors were predominantly 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50% being lymph node positive. Trie 
fraction of tumor cells within specimens averaged at least 50% 
Details of individual tumors have been published (8, 91, and" 
are summarized in Table 1, which is published as supporting 
information on the PNAS web site, www.pnas.org. Brealt cancer 
cell lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/chloroform extraction 
followed by ethanol precipitation. 

ONA labeling and Microarray Hybridizations. Genomic DNA label- 
ing and hybridizations were performed essentially as described 
in pollack etol. (7). with slight modifications. Two micrograms 
of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted accordingly. 'Test" DNA 
(from tumors and cell lines) was f Juorescenuy labeled (Cy5) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genes (i.e., UniGene dusters), the 
reference (labeled with Cy3) for each hybridization was nor- 
mal female leutocyte DNA from a single donor. The fabrication 
of cDNA microarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePor scanner (Axon Instruments, Foster City, CA), and 
fluorescence ratios (test/reference) calculated using scanalyze 
software (available at http://rana.lbl.gov). Fluorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to 0. Measure- 
ments with fluorescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles arc 
displayed as a moving average (symmetric 5-uearest neighbors). I 
Map positions for arrayed human cDNAs were assigned by 



Abbreviation: CGH, comparative genomic hybr hfizatton. 
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one of 6,691 dif ferentmappedhum™ T^^^^^^^^V^^ * *«erent cell line or tumor, and each columnTepresents 

Nearest neighbors) 

Amplification, green luminescence JZfcSSSff ^tatoS^T^ T* - * * at ■* '-ninescence reflects 
copy number pro^ao-^thexchro^ome,^ 



n ^? g th ° startin 8 Potion of the best and longest match of 
any UNA sequence represented in the corresponding UniGene 
cluster^lO) against the "Golden Path" genome assembly 
(http://genome.ucsc.edu/; Oct 7, 2000 Freeze). For UniGene 
clusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for all elements representing the same UniGene 
cluster) are reported. For mRNA measurements, fluorescence 
ratios are "mean-centered" (Le., reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH oo 44 predominantly locally advanced, 

P EE ^ eaSl tUmors and 10 breast can <*r eel! lines, using 
cDNA microarrays containing 6,691 different mapped human 
genes (Fig. la; also see Materials and Methods for dctaUs of 
microarray hybridizations). To take full advantage of the im- 
proved spatial resolution of array CGH, we ordered (fluores- 
cence ratios for) the 6,691 cDNAs according to the "Golden 
mn (http://gcnomc.ucsc.edu/) genome assembly of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g. 
candidate oncogenes within amplicons), but also provide precise' 
genetic landmarksfor chromosomal regions of amplification and 
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deletion. Parallel anarysis of DNA from cell lines containing 
different numbers of X chromosomes (Fig. lb), as we did before 
(7), demonstrated the sensitivity of our method to detect single* 
copy loss (45. XO), and 15- (47,XXX), 2- (4&\XXXX), or 
2.5-fold (49,XXXXX) gains (also see Fig. 5, which is published 
as supporting information on the PNAS web site). Fluorescence 
ranos were linearly proportional to copy number ratios, which 
were slightly underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer ceU lines and primary tumors 
(Fig. la), detected in the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found in every cancer cell line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss were readily iden- 
tifiable. For example r gains within Iq, 8q, 17q, and 20q were 
«» 8 hi &h proportion of breast cancer cell lines/tumors 
(90%/69%, 100%/47%, 100%/60%, and 90%/44%, respective- 
werc ,0SS€S withln *P» 3 P. 8 P» ^ i3q (80%/24% 

with published cytogenetic studies (refc. 2-4; a complete listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the PNAS web site). The total 
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ONAcopynumberalteratiCHiacfOss*^ 
of X chromosomes, for breast cancer con lines, and for breast tumors. Breast cancer cell line* and tumors are separately ordered by hierarchical clustering to 
highlight recurrent copy number changes. The 241 genes present cm the microarrays and mapping to chromosome 8 are ordered by position along the 
chromosome. Fluorescence ratios (test/reference) are depicted by a log, pseudocolor scale (indicated). Selected genes are Indicated with color-coded text (red 
increased; green, decreased; Mac*, no change; gray, not wet) measured) to reflect correspondingly altered mRNA levels (observed m the majority of the subset 
of samples displaying the DNA copy number change). The map positions for genes of Interest that are not represented on the mlcroarray are indicated in the 
row above those genes represented on the array. (6) graphical display of DMA copy number profile for breast cancer cell line 5K8R3. Fluorescence ratios 
(tumor/normal) are plotted on a log, scale for chromosome 8 genes, ordered along the chromosome. 



number of genomic alterations (gains and tosses) was found to 
be significantly higher in breast tumors that were high grade (P = 
0.008), consistent with published CGH data (3), estrogen recep- 
tor negative {P « 0.04), and harboring TP53 mutations (P = 
0.0006) (see Table 4, which is published as supporting informa- 
tion on the PNAS web site). 

The improved spatial resolution of our array CGH analysis is 
illustrated tor chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. la). The complexity of amplicdn 
structure is most easUy appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8q in SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled 1-3 in Fig. 2b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as well as the 
recurrently amplified regions on chromosomes 17 and 20* can be 
found in Figs. 6 and 7, which are published as supporting 
information on the PNAS web site). 

For a subset of breast cancer cell lines and tumors (4 and 37, 
respectively), and a subset of arrayed genes (6,095), mRNA 
levels were quantitatively measured in parallel by using cDNA 
microarrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor cells. 

A strong influence of DNA copy number on gene expression 
is evident in an examination of the pseudocolor representations 
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of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). The overall patterns of gene amplification and 
elevated gene expression are quite concordant; i.e., a significant 
traction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplE 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifier 
tions (fluorescence ratios >4/ and representing 91 different 
genes) 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean -centered fluorescence 
ratios >4). 

To determine the extent to which DNA deletion and lower- 
level amp iflcation (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 ccU lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low- 
medjum-. and high-level amplification (Fig. 4a). For both the 
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breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pair-wise Student's 
t tests comparing adjacent classes: ceil lines, 4 x 10^, 1 x 10~ 49 
5 X 10 j 5 , 1 X 10-*; tumors, 1 X lO^ 43 , 1 X 10-»* 5 X 10~< l ! 
1 x 10 <). A linear regression of the average log(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
number was accompanied by 1,4- and 1.5-fold changes in mRNA 
level for the breast cancer cell lines and tumors, respectively ( Fig. 
4a, regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 4b). 
The distribution of correlations forms a normal-shaped curve 
but with the peak markedly shifted in the positive direction from 
zero. This shift is statistically significant, as evidenced In a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest correlations between DNA 
copy number and mRNA level (the right tail of the distribution 
in Fig. 4b) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
7% of all of the observed variation in raRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. 4d). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence Intensity/ 
background >3); using that data, our estimate of the percent 
variation in raRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. 4$. This stilt undoubtedly 
represents a significant underestimate, as the observed variation 
m global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array COH analysis of DNA copy number 
alteration in a series of human breast tumors demonstrates the 
usefulness of defining amplicon boundaries at high resolution 
(gene-by gene), and quantitatively measuring amplicon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 
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cell lines and tumors. Although the DNA microanays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely underestimating the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be generalizable 
(but would nevertheless still be remarkable if only applicable to 
this set of -6,100 genes), vy 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene expression in cancer cells. In 
agreement with our findings, Phiffips et aL (14) have shown that 
with the acquisition of tumorigenicity in an immortalized pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a staUstically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et al. (15) recently reported that in metastatic 
colon tumors only -4% of genes within amplified regions were 
found more highly (>2-fold) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Plateer et al (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unamplified chromosomal regions were found to 
exhibit at least 4-foid altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal CGH may have resulted in poorly delimiting the bound- 
aries of high-complexity arapiicons, effectively overcalling re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in rumors. Second, it suggests that 
most genes are not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampH- 
con boundaries and shape (to identity the "driving*' gene(s) 
within amplicons (16)), on a targe number of samples, In addition 
to functional studies. Fourth, this finding suggests that analyzing 
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the genomic distribution olf expressed genes, even within existing 
microarray gene expression data sets, may permit the inference 
of DNA copy number aberration* particularly aneuploidy (where 
gene expression can be averaged across large chromosomal 
regions; see Fig. 3 and supporting information). Fifth, this 
finding implies that a substantial portion of the phenotypic 
uniqueness (and by extension, the heterogeneity in clinical 
behavior) among patients' tumors may be traceable to underly- 
ing variation in DNA copy number. Sixth, this finding supports 
a possible role for widespread DNA copy number alteration in 
turnorigcnesis,(17, 18), beyond the amplification of specific 
oncogenes. and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stocMornetric relationships in cell metabolism and physiology 
(e.g., proteosome, mitotic spindle), possibly promoting further 
chromosomal instability and directly contributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression in cancer. 
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Abstract 

Background: Colorectal cancer is a common cancer all over the world. Aberrations in the cell 
cycle checkpoints have been shown to be of prognostic significance in colorectal cancer. 

Methods: The expression of cyclin D I , cyclin A, histone H3 and KJ-67 was examined in 60 colorectal 
cancer cases for co-regulation and impact on overall survival using immunohistochemistry, 
southern blot and in situ hybridization techniques. Immunoreactivity was evaluated semi 
quantitatively by determining the staining index of the studied proteins. 

Results: There was a significant correlation between cyclin Dl gene amplification and protein 
overexpression (concordance = 63.6%) and between Ki-67 and the other studied proteins. The 
staining index for Ki-67, cyclin A and Dl was higher In large, poorly differentiated tumors. The 
staining index of cyclin D I was significantly higher in cases with deeply invasive tumors and nodal 
metastasis. Overexpression of cyclin A and Dl and amplification of cyclin Dl were associated with 
reduced overall survival. Multivariate analysis shows that cyclin Dl and A are two independent 
prognostic factors in colorectal cancer patients. 

Conclusions: Loss of cell cycle checkpoints control is common in colorectal cancer. Cyclin A and 
D I are superior independent indicators of poor prognosis in colorectal cancer patients. Therefore, 
they may help in predicting the clinical outcome of those patients on an individual basis and could 
be considered important therapeutic targets. 



Background 

Colorectal cancer (CRC) is the third most common cancer 
in Western countries In Egypt, CRC has unique char- 



acteristics that differ from that reported in other countries 
of the western society. It was estimated that 35.6% of the 
Egyptian CRC cases are below 40 years of age and patients 
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usually present with advanced stage, high grade tumors 
that carry more mutations (2). This uniquely high propor- 
tion of early-onset CRC, the early and continuous expo- 
sure to hazardous environmental agents, the different 
mutational spectrum and the prevalent consanguinity in 
Egypt justify further studies |3). It was proved that most 
cancers result from accumulation of genetic alterations 
involving certain groups of genes, the majority of which 
are cell cycle regulators that either stimulate or inhibit cell 
cycle progression [lj. Cell proliferation allows orderly 
progression through the cell cycle, which is governed by a 
number of proteins including cyclins and cyclin dependent 
kinases (4,5]. The cyclins belong to a superfamily of genes 
whose products complex with various cyc/m-dependent 
kinases (cdks) to regulate transitions through key check- 
points of the cell cycle [6J. Abnormalities of several cyclins 
have been reported in different tumor types, implicating, 
in particular, cyclin A, cyclin E and cyclin D [6,7|. 

Cyclin Dl is a Gl cyclin that regulates the transition from 
Gl to S phase since its peak level and maximum activity 
are reached during the Gl phase of the cell cycle. Whereas 



cyclin A is regarded a regulator of the transition to mitosis 
since it reaches its maximum level during the S and G2 
phases [8j. The mechanisms likely to activate the onco- 
genic properties of the cyclins include chromosomal trans- 
locations, gene amplification and aberrant protein 
overexpression [7,9]. 

Several studies have shown that, hislone H3 mRNA expres- 
sion can be used to identify the S phase fraction (SPF) 
through the in situ hybridization (1SH) technique 1 10,1 1|. 
The level of hislone H3 mRNA reaches its peak during the 
S phase and then drops rapidly at the G2 phase [ 1 2 j. 

In face of the increasing incidence of CRC and its peculiar 
pattern in the Egyptian population, the present study was 
conducted to assess the role of Ki-67 (pan-cell cycle 
marker), cyclin Dl (Gl phase marker), histone H3 mRNA 
(S phase marker), cyclin A (S to G2 phase marker) in CRC. 
The expression level of these markers was correlated to the 
clinicopathologic features and the overall survival of 
patients. 



Table I : Clinicopathological features of patients in relation to the staining index (SI) of Ki-67, cyclin D J, cyclin A, histone H3 

SI (mean + SO) 



Variables No. of cases Ki-67 Cyclin Dl Cyclin A Histone H3 



Sex 

Male 

Female 
Age (years) 

250 

<S0 

Tumor size (cm) 

<5.0 

25.0 
Hirtofogy 

Normal 

Carcinoma 

CI 

Gil 

GUI 
Lymph node 

Negative 

Positive 
Depth of invasion 

m, sm 

beyond sm 
Stage 

I 

II 
III 
IV 



36 
24 

41 

[9 

33 
27 

20 
60 
15 
21 
24 

33 
27 

17 
43 

6 
27 
12 
15 



18.0 ±6.4 

20.1 ±5.8 

1 1.7 ±6.0* 

23.8 ± 5.6 

12.2 ±6.3* 
30.1 ±6,2 

3.S ± 2.0* 

30.3 ± 6.2 
1 1.7 ±6.2 
I 1.8 ± 5.6 
30.0 ±4.3 

19.5 ±7.0 
21.3 ±4.9 

20.7 ± 6.7 

21.9 ±6.2 

20.6 ± 67 

20.8 ± 6.9 
22.0 ±5.4 

24.7 ±6.1 



6.7 ± 4.3 

8.8 ± 8.4 

5.6 ± 5.2 

7.7 ± 6.8 

5.3 ± 3.8* 

22.8 ±7.2 

0.6 ± 0.2* 

24.9 ± 6.3 

6.6 ± 4.0 

8.9 ± 3.6 
22.0 ±8. 1 

5.4 ± 5.3* 
20.6 ± 6.9 

3.1 ± 3.1* 
12.4 ±6.5 

5.7 ± 6.9 
5.3 ± 4.3 
7.7 ± 6.0 

1 1.3 ±9.6 



12.7 ±5.7 
10.0 ±6.0 

10.0 ±5.3 
13.6 ±57 

1 1.5 ±6.1* 

28.6 ± 5.6 

2.3 ± I.I* 

27.2 ± 5.8 
10.0 ±5.4 

12.3 ±6.5 
27.0 ± 4.9 

11.9 ±6.5 

12.5 ±5.0 

1 1.9 ±7.2 
12.2 ±5.6 

24.2 ± 6.9 

24.6 ± 6.0 
27.) ±5.2 
27.5 ± S.5 



10.7 ±5.3 
10.7 ±5.4 

6.0 ± 5.0* 
2X0 ± 5.2 

10.3 ±4.9* 
24.0 ± 5.6 

2.2 ± 0.9 
107 ±5.3 

1 1.4 ±4.9 
7.8 ± 5.4 

1 1.5 ±5.4 

12.3 ±5.5 

14.2 ±5.0 

10.4 ±5.1 
107 ±5.4 

IU ±5.3 
10.4 ± 57 
10.4 ± 4.9 

12.3 ± 6.2 



* p. value < 0.05 (significant) 
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Methods 
Tissue samples 

Paraffin-embedded tumor tissues were obtained from 60 
CRC patients (47 colon and 13 rectal carcinomas) that 
were diagnosed and treated at the National Cancer Insti- 
tute, Cairo, Egypt during the period from January, 1997 to 
June, 2002. Clinicopathological data of the studied cases 
are illustrated in table 1 . None of the patients received any 
chemotherapy or irradiation prior to surgery. Histological 
diagnosis of all cases was done by 2 independent pathol- 
ogists according to the WHO Histological Classification. 
Tumors were staged according to the TNM staging system 
[13]. The depth of tumor invasion was classified as inva- 
sion of the mucosa including muscularis mucosa (m), 
invasion of the submucosa (sm), or invasion beyond the 
submucosa [8]. Normal colonic tissues were obtained 
from autopsy specimens (n = 20) and were used as a con- 
trol. The actual survival rate of the patients was calculated 
from the date of resection to the date of death. 

Immunohistoch emistry 

Four micron sections of each normal and tumor specimen 
were cut onto positive-charged slides; air dried overnight, 
de-paraffinized in xylene, hydrated through a series of 
graded alcohol and washed in distilled water and 0.01 
PBS (pH 7 A). Slides were then processed for IHC as 
described by Handa et al. [8]. using the following anti- 
bodies: Ki-67 (MIB-1, Dako), cyclin A (6E6; Novocastra, 
Newcastle-Upon-Tyne, UK) and cyclin Dl (DCS-6, Dako). 
A case of invasive breast cancer was used as a positive con- 
trol for Ki-67 and cyclin A whereas a case of mantle cell 
lymphoma was used as a control for cyclin Dl. Negative 
controls were obtained by replacing the primary antibody 
by non-immunized rabbit or mouse serum. 

Brown nuclear staining was regarded as a positive result 
for all studied markers. The proportion of positively- 
stained cells and the intensity of staining were scored in 
tumor and normal colorectal mucosal sections at medium 
power (x200).The degree of positive tumor staining (per- 
centage of positive tumor cells in the examined section) 
was scored from 1 -6 and the staining intensity was scored 
from 0-6 according to the pattern of staining in the exam- 
ined section. Staining index (SI) was calculated by multi- 
plying the cellularity and staining scores as described by 
Kinget al. [14). 

In situ hybridization 

All tumor samples and 5 normal controls were assessed 
for histone H3 mRNA by ISH using the commercially avail- 
able 550 base fluorescein-labeled DNA probe (Dako, 
Carpinteria, CA) as described by Nagao et al., 1996. This 
probe hybridizes to the whole mRNA transcript of the 
human histoneH3 gene including the5' and 3' untrans- 
lated regions. Scoring of histone H3 mRNA was performed 



as for immunohistochemistry, however, hybridization 
signals were detected in the cytoplasm. 

Molecular detection of cyclin Dl gene amplification 
High molecular weight DNA was extracted from paraffin- 
embedded tissues of the tumor and normal colorectal 
mucosal samples as previously described [15], The pro- 
portion of neoplastic and normal cells was determined in 
each tumor sample by examining hematoxylin and eosin- 
stained slides obtained from the edge of the specimen 
used for DNA extraction. Tumor samples were evaluated 
for amplification of cyclin Dl if more than 75% of the 
examined sections were formed of neoplastic cells. 
Accordingly, 50 cases were eligible for the analysis. Ten 
micrograms of the extracted DNA was digested with 
EcoRl. DNA from selected cases was also digested with 
Bg/II and HindM. Samples were separated on 0.8% agar- 
ose gels and transferred to Hybond-N membranes (Amer- 
sham Int., Amersham, UK). The membranes were 
hybridized with 50% formamide, 5 * SSC, 5 x Denhardt's, 
500 ug/ml denatured salmon sperm DNA, 10% dextran 
sulphate and 10 6 cpm/ml of* 2 P-labeled PRAD-1 probe for 
24 h. Membranes were washed with 2 x SSC, 0. 1% SDS at 
room temperature for 30 min followed by 2 x SSC, 0.1% 
SDS at 60°C for 30 min and 0.1 x SSC, 0. 1% SDS at 60 °C 
for 1 h. Filters were autoradiographed using an intensify- 
ing screen at -70 °C for 24-72 h. After being stripped free 
of the PRAD-1 probe, the same blots were hybridized with 
32 P-labeled B-actin probe to normalize against possible 
variations in the loading or transfer of DNA. The autora- 
diograms were analyzed using a densitometer. Intensities 
of PRAD-1 /cyclin Dl were normalized to the p-aciin con- 
trol bands. The degree of amplification was calculated 
from these normalized values. Amplification was consid- 
ered when the signal of the tumor band was >2-fold the 
value of the matched normal mucosa [16]. 

Statistical analysis 

The Mann-Whitney non-parametric test was used to com- 
pare the Sis of pairs of subjects whereas the Kruskal-wallis 
was used for categorial data. Correlation between indices 
was performed using a simple linear regression test. The 
Kaplan-Meier method was used to create survival curves 
which were analyzed by the log-rank test. The impact of 
different variables on survival was determined using the 
Cox proportional hazards model, p. values less than 0.05 
were considered significant. 

Results 

The results of IHC are illustrated in figures 1 and 2. In gen- 
eral, the staining index (Sis) of all studied markers was 
higher in carcinomas than in normal colonic mucosal 
samples (p = 0.0001). Normal colorectal mucosa revealed 
positive imunostaining for Ki-67 in the lower half of the 
crypts only. A heterogeneous staining pattern was 
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Figure I 

Normal colonic mucosa showing positive nuclear immunostaining for: (a) cyclin D/, (b) ISH of histone H3 mRNA, (c) K7-67 and 
(d) cyclin A 



detected in the neoplastic cells of well and moderately-dif- 
ferentiated adenocarcinomas whereas a diffuse homoge- 
neous staining pattern was detected in poorly- 
differentiated carcinomas. The SI ranged from 10-40.2 
(mean: 24.6 ± 6.5). 

Immunostaining for cyclin Dl was predominantly nuclear 
but cytoplasmic staining was detected in some cases. 
However, unless a nuclear staining was also detected, 
cases with cytoplasmic staining were considered negative- 
Normal colorectal mucosal samples were almost negative 
for cyclin Dl whereas 41 out of the 60 (68.3%) CRC cases 
were positive. Marked heterogeneity was observed in well- 
and moderately-differentiated adenocarcinomas even 
within the same tumor. Poorly-differentiated carcinomas 
revealed a diffuse staining pattern with more darkly- 
stained nuclei. The SI ranged from 0.5-28.6 (mean: 9.3 ± 
4.2). 



Positive nuclear staining for cyclin A was detected in 80% 
(48/60) of CRC cases and in all no n -neoplastic control 
samples. Positively-stained nuclei were confined to the 
lower half of the crypts in normal colonic mucosa and dif- 
fusely-dispersed in carcinomas. The SI ranged from 3.3- 
30.2 (mean: 15.1 ± 6.6). 

Histone H3 mRNA was intensely expressed in the cyto- 
plasm of all examined samples either neoplastic or non- 
neoplastic. The distribution of histone H3 mRNA was 
similar to that of cyclin A and Ki-67 however, the propor- 
tion of histone H3 mRNA positive cells was less than thai 
of /Ci-6*7.The SI ranged from 1.8-24.2 (mean: 12.4 ± 5.3). 

The PRAD-1 probe detected 3 EcoRI fragments of 4.0, 2.2 
and 2.0 and 1 Bg/II fragment of 15 Kb. PRAD-1 /cyclin Dl 
gene amplification was detected in 22/50 (44%) cases 
analyzed. The degree of amplification was heterogeneous 
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Figure 2 

A case of well differentiated adenocarcinoma with positive immunostainlng for: (a) cydin 0/ T (b) histone H3 mRNA, (c) K/-67, 
and (d) cydin A. Another case of moderately differentiated denocarcinoma with positive immunostaining for; (e) cydin D I , (f) 
histone H3 mRNA, (g) and (h) cyc//n A. A case of poorly differentiated adenocarcinoma with diffuse staining for: (i) cydin 
D t . (j) ISH of histone H3 mRNA, (k) Ki-6 7 and (I) cydin A. 



with 2-10 fold increase when compared to normal 
mucosal samples (Figure 3). Amplification was confirmed 
by other restriction enzymes. 

Correlations 

There was a significant correlation between cydin DJ gene 
amplification and protein overexpression. Out of the 22 



cases that showed amplification 14 showed protein over- 
expression (concordance = 63.6%). 

Linear regression analysis of Sis revealed a significant cor- 
relation between Ki-67 and cydin Dl, cydin A, histone H3 
as well as between the Sis of cyclin A and histone H3 (p = 
0.008, 0.0001, and 0.0001 respectively) (Figure 4), There 
was a significant relationship between the SI of both Ki-67 
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Figure 3 

A: Southern blot analysis of normal mucosa (N) and their seven corresponding cases of colonic adenocarcinomas (TI-T7). 
cases No. 1 , 2, 4, and 5 are poorly differentiated whereas cases No. 3, 6, and 7 are moderately differentiated. Genomic DNA 
was digested with Bg/ll, fractionated by electrophoresis in agarose gel, transferred onto membranes and hybridized with PRADI 
and ficctin. Tumors number I-6 (Lanes I-6) show different degrees of PRAD Ucydin D / amplification, tumor number 7 (lane 7) 
was not amplified. B: Southern blot analysis of 3 cases of adenocarcinomas (T) and matched normal colonic mucosa (N). 
Genomic DNA was digested with EeoRt, fractionated by electrophoresis in agarose gel, transferred onto membranes and 
hybridized with PRADI and fiactin probes for loading control. The identification of the 3 tumors is the same as in Fig, 3 A with 
amplification of PRAD Ucydin D/ in tumors number 4, 5 (Lanes I, 2) but not 7 (Lane 3). 



and cyclin A and the degree of differentiation of tumors as 
well as the size of the tumor (p < 0.001 and p < 0,01 
respectively). In addition, SI of Ki-67 and histone H3 were 
higher in patients <50 years than in those £50 years (p < 
0.05) (table 1). 



In addition table 2 shows a significant relationship 
between high cydin Dl SI and large, poorly-differentiated 
tumors, carcinomas with positive lymph node metastasis 
and deeply-invasive carcinomas (p < 0.05, p < 0.001, p < 
0.05 and p < 0.05 respectively). Whereas cydin DJ gene 
amplification was significantly associated with an 
advanced disease stage since amplification was detected in 
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Figure 4 

Correlation between the staining intensity of (a) Ki-67 vs. cydin D/, (b) Ki-67 vs. histone H3, (c) Ki-67 vs. cyd/n A and (d) cyd/n A 
vs. histone H3 mRNA expression. 



10/15 (66.7%) of stage IV tumors compared to 12/45 
(26.7%) of stage Mil tumors (p = 0.002). Similarly, DNA 
amplification was detected in 60.5% (26/43) of the carci- 
nomas with extensive local invasion (beyond sm) but 
only in 23.5% (4/17) of the carcinomas with limited inva- 
sion (m, sm) (p = 0.001). A significant correlation was 
also present between cyclin Dl gene amplification and the 
presence of lymph node metastasis (p = 0.008) as well as 
between the SI of histone H3, the size of the tumor and the 
patient's age (p < 0.05, p < 0.001 respectively). The SI was 
higher in tumors >5 cm in diameter and in patients <50 
years. 

Survival analysis 

The mean follow-up period for all patients was 30 months 
(range: 1-66 months). Eighteen of 60 patients had 
already died by the time the study was completed. We 



defined the cutoff level for overexpression of each cell 
cycle marker at the point that showed the maximum dif- 
ference of survival rate between the 2 groups separated by 
that point. Cox regression analysis revealed that cyclin A 
overexpression (our definition: SI > 10.5), cyclin Dl over- 
expression (our definition: SI £ 6. 1 ), poorly differentiated 
histology, lymph node metastasis, TNM stage, tumor size 
and depth of invasion were all significant prognostic var- 
iables for survival (Table 3). The Kaplan-Meier survival 
curves for the subgroups of patients who are subdivided 
according to each marker's status are shown in Figure 5. 
Patient with tumors that showed Ki-67 overexpression 
(our definition: SI > 11.5) and histone H3 overexpression 
(our definition: SI £ 8.2) tended to have poor prognosis 
but this did not reach a statistically significant level, how- 
ever the overall survival was significantly lower in patient 
with cyclin A and cyclin Dl overexpression. Cox multivari- 
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Table t The relation between cycUn Dl overexpression vs cycUn Dl amplification and c St nicopat ho logical prognostic markers. 
Variables No. of cases CycUn Dl overexpression CycUn Dl Amplification 



Tumor size (cm) 



<5.0 


33 


5.3 ± 3.8* 


13/33 


£5.0 


27 


22.8 ± 7.2 p <O.0S 


9/27 p <0.236 


Histology 








G\ 


15 


6.6 ± 4.0 


7/15 


Gil 


21 


8.9 ± 3.6 


8/21 


Gill 


24 


22.0 ±8.1 p<0.00l 


7/24 p <0.O75 


Lymph node 








Negative 


33 


5.4 ± 5.3* 


6/33 (18.2%) 


Positive 


27 


20.6 ± 6.9 p <0.05 


16/27 (59.3%) p<0.008 


Depth of invasion 








m, sm 


17 


3.1 ±3.1* 


4/17(23.5%) 


beyond sm 


43 


IZ4±6.5p<0-05 


26/43 (60.5%) p«U>0/ 


Stage 








early 


45 


5.5 ± 10. 1 


12/45 (26.7%) 


late 


15 


1 1.3 ± 9.6 P = 0./75 


10/15 (66.7%) p<0.002 



Table 3: Uunlvariate analysis of the relationship between survival and the tested markers 
PredictiveVariables Median Survival HR CI P 



Kf-67 



<IL5 


36 








SI 1.5 


32 


1.826 


0.636-5.243 


0.26 


CyclinDl 










<6.l 


35 








£6.1 


18 


7.246 


1.007-45.150 


0.03* 


Hlstone H3 










<8.2 


35 








>8.2 


29 


4.639 


0.854-25.196 


0.07 


CycUn A 










<I0.5 


35 








SI 0.5 


15 


7.820 


1.017 -60.122 


0.02* 


Histological grade 










Low 


38 








High 


[0 


7.331 


2.696-19.940 


0.0001* 


Lymph node 










Negative 


38 








Positive 


15 


6.826 


1.973-23.621 


0.002* 


Stage 










1, II, III 


38 








IV 


12 


6.378 


1.842 -2Z083 


0,001* 


Tumor size (cm) 










<S.O 


35 








£5.0 


13 


4.835 


1.386-16.868 


0.0 1* 


Depth of invasion 










TI.T2 


36 








T3.T4 


20 


7.759 


1.024-58.789 


0.04* 


Age (years) 










<50 


38 








550 


28 


Z802 


0.988 - 7.943 


0.0526 


Sex 










Male 


38 








Female 


36 


0.696 


00.274-1766 


0.4449 



* p. value < 0.05 (significant) 

Hk Hazard Ratio 

CI: 95% confidence Interval 
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Figure 5 

Kaplan-Meier survival curves for colorectal carcinoma. Overall survival is significantly lower in patients with (a) cydin A and (b) 
cydin 01 overexpression. Patients with high SI for histone H3 mr\NA have poorer prognosis but this was not statistically signif- 
icant (c). No significant difference was present between patients with high Kh67 SI and those with low Kf-67 SI (d). 



ate regression analysis revealed that lymph node metasta- 
sis, cydin A and cydin Dl overexpression were 
independent negative prognostic factors after adjustment 
for the depth of tumor invasion, age and sex of the patient 
(Table 4). 

Discussion 

The proliferative activity of CRC ceils has been investi- 
gated in several studies either by immunohistochemical 
determination of cell proliferation index using antibodies 
to some types of cydins or by flowcy to metric determina- 
tion of the SPF of the cell cycle [8], Although Leach et al. 
[17] did not find cydin Dl gene amplification in a panel 
of 47 CRC cell lines; its protein was overexpressed in 
about 30% of CRC cases that were included in the studies 



of Bartakova et al. [6] and Arber et aL [18]. In the former 
study [6]cydin Dl was aberrantly accumulated in a 
significant subset of human CRC cases and the cell lines 
derived from these cases were dependent on cydin in their 
cell cycle progression. In the second study [18], overex- 
pression of cydin Dl was detected in 30% of adenomatous 
polyps indicating that overexpression is a relatively early 
event in colon carcinogenesis which is possibly responsi- 
ble for the pathological changes in the mucosa preceding 
neoplastic transformation. More recently, Holland et al. 
[19], Pasz-Walczak et al. [20] and Utsunomiya et aL [21] 
reported up-regulation of cydin Dl in 58.7%, 100% and 
43% of their studied cases respectively. 
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Table 4: Multivariate analysis oft he relationship between survival 
and thetested markers 



PredictlveVariabtes 


HR 


a 


P 


CydinDI 


10.864 


t.055- 86.250 


0.03* 


(baseline < 6. 1) 








CydinA 


13.886 


1.012- 190379 


0.0490* 


(baseline < 1 0.5) 








Positive Lymph node 


3.921 


1.057- 14.472 


0.0410* 


metastasis 








Stage iV 


3.411 


1.048- 12.083 


0.03* 


Depth of invasion 








T3.T4 


5.408 


0.449-65.080 


0.1836 


Age (years) 










1.996 


0.678 - 5.878 


0.2310 


Sex 


0.910 


0.315-2.358 


0.8453 



p. value < 0.05 (significant) 

HR: Hazard Ratio 

CI: 95% confidence Interval 



In the present study, up- regulation of cyclin Dl was 
detected in 68.3% of the cases. The SI was significantly 
higher in carcinomas than in normal colorectal mucosa 
and in poorly-differentiated adenocarcinomas it was 
approximately twice that of other histological types. 
Amplification and/or overexpression of cyclin Dl signifi- 
cantly correlated with deeply invasive tumors and positive 
lymph node metastasis. Our results in this regards are con- 
sistent with previous studies [8,22]. In 2001, Holland et 
al. [19]. demonstrated that deregulation of cyclin Dl and 
p21 wa f proteins are important in colorectal tumorigenesis 
and have implications for patient prognosis. Similarly 
McKay et al. [23] found that cyclin Dl was the only protein 
in their panel {cyclin Dl, p53, pI6, Rb-1, PCNA and p27) 
that correlated with improved outcome in CRC patients. 
However, few studies failed to detect any correlation 
between cyclin Dl overexpression and the 
clinicopathological factors in CRC [6,18]. This contro- 
versy in results could partially be explained by the differ- 
ence in the sampling of studied cases. The present study 
included 24 cases of poorly differentiated adenocarci- 
noma, which is not common in other studies of CRC in 
western countries. This was possible because the majority 
of CRC cases diagnosed in Egypt are of high histological 
grade (3 J. The correlation between cyclin Dl overexpres- 
sion and the high histological grade was also reported in 
other tumor types including non-small cell lung 
carcinomas [24] and squamous cell carcinomas of the lar- 
ynx (16]. Another possible explanation for the observed 
controversy in the results of different studies is the detec- 
tion method used. 

In the present work, overexpression of cyclin Dl was more 
common than gene amplification of the PRAD-1 /cyclin Dl 



gene with a 63.6% concordance. This was similarly 
reported by Bartakova et al. |6] who mentioned that there 
is a subset of CRC cases in which cyclin Dl is overex- 
pressed without PRAD-1 /cyclin Dl gene amplification. 
Consistent with this hypothesis are reports of elevated cyc- 
lin Dl mRNA levels and immunohistochemically detecta- 
ble accumulation of the protein in over one third of breast 
cancer cases at a frequency significantly higher than that 
deduced from DNA amplification studies |9,25]. These 
data imply that mechanisms other than gene 
amplification can also lead to deregulation and accumu- 
lation of cyclin Dl in solid tumors. 

So far, several studies were done to reveal the prognostic 
significance of cyclin Dl overexpression in various carci- 
nomas, including CRC [22]. However, these studies 
yielded conflicting results which could be attributed to 
organ heterogeneity. In our study, patients with tumors 
that exhibited cyclin Dl overexpression tended to have 
poor prognosis. 

It was reported that, patients with cyclin A positive carci- 
nomas had significantly shorter median survival times. 
Handa et al. [8] were able to detect cyclin A overexpression 
in 77% of their CRC cases. They also demonstrated that, 
cylcin A could be used as a prognostic factor of CRC. More 
recently, Habermann et al. [26] studied cases of ulcerative 
colitis with and without an associated adenocarcinoma 
for the presence of cyclin A overexpression. They found 
that, cyclin A overexpression was higher in cases of ulcera- 
tive colitis with adenocarcinomas than in those without 
adenocarcinomas. Consequently, they concluded that, 
cyclin A could be used for monitoring ulcerative colitis 
patients and for the early detection of an emerging carci- 
noma in this high risk group of patients. 

In our study, cyclin A was detected in 80% of the patients 
and Cox regression analysis showed that it could be used 
as a prognostic marker in CRC in addition to cyclin Dl, 

It would have been useful if we assessed the expression 
level of cyclin A by another technique (DNA 
amplification). This would have added more information 
regarding the gene status on one hand and confirmed the 
results of 1HC on the other hand. Unfortunately, this was 
not possible because in most of the cases included in the 
present work, the extracted DNA was not sufficient to 
study cyclin amplification after the assessment of cyclin Dl. 

In 1996, Nagao et al. [11] reported that histone H3 labe- 
ling index significantly correlated with ki-67 immunos- 
taining and was high in poorly differentiated human 
hepatocellular carcinoma. This was similarly reported in 
the present work since we found a significant correlation 
between the SI of histone H3 and Ki-67. However, no 
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statistically significant correlation was found between his- 
tone H3 SI and any of the studied clinicopathological 
factors. 

Although Ki-67 immunostaining reflects the proliferative 
activity of CRC, it has not been recognized as a significant 
prognostic factor in this type of tumors [27,28], However, 
Suzuki at al. [29] found a significant correlation between 
Ki-67 labeling index and local invasion of CRC. In the 
present study there was a significant relationship between 
the SI of Ki-67, tumor size and grade. However, Kaplan- 
Meier survival curves showed no significant difference in 
survival rates between patients with- and without overex- 
pression of Ki-67. 

Conclusions 

Our results demonstrate that cyclin Dl, cyclin A, histone H3 
and Ki-67 are overexpressed in a subset of CRC, however 
only cyclin Dl and cyclin A overexpression correlates with 
poor differentiation and tumor progression. This indi- 
cates the superiority of cyclin A and cyclin Dl as indicators 
of poor prognosis compared to Ki-67 and histone H3 
mRNA in CRC. Cyclin A and Dl could therefore be consid- 
ered significant, independent prognostic factors in CRC 
patients. These findings are especially important in stage 
II patients since 25-30% of those patients have poor prog- 
nosis in spite of being node-negative. However, the stand- 
ard clinicopathologic prognostic factors can not identify 
this subset accurately and therefore; there is a great 
demand for more accurate, individually-based, biological 
prognostic parameters that help in detecting this high risk 
group of patients who can benefit from an adjuvant ther- 
apy. If the findings of the present study are confirmed in a 
larger study, evaluation of cyclin A and Dl may be applica- 
ble to clinical management of CRC, allowing the identifi- 
cation of patients with poor prognosis. 
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Biology of the Cell at 302, emphasis added. Similarly, figure 6-90 on page 364 Elustra'tes the 
path from gene to protein. The accompanying text states that while potentially each step can be 
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S. J. Wysocki, E. Hahnel, A. Masters, V. Smith, A. J. McCartney and R. Hahnel (1990) 

Detection of pS2 messenger RNA in gynecological cancers. Cancer Research 50 1800- 
1802 



Patents 

Genetic Footprinting: Insertional Mutagenesis and Genetic Selection. U.S. Patent No. 
5,612,180. Inventors: Patrick Brown and Victoria Smith 

PATENTS EELED ( at Genentech Inc.) 

Methods of Detecting and Quantifying Gene Expression. Inventors: Victoria Smith, 
Edward Robbie, David Lowe, James Marsters. 

Compositions and Methods for the Treatment of Cancer. Inventors: Victoria Smith, 
Austin Gurney, Audrey Goddard, Fred DeSauvage 

Diagnostic for Dysplasia in Barrett's Esophagus. Inventor: Victoria Smith 

Numerous composition of matter filings related to novel gene discovery, pending 



EXHIBIT B 



Unq Id DNA Id Experiment Name 

Luna Tumor Samples 

851. 157,736. 100ngLungBaCa1069 vs 25ngEpi1409 

851. 157,736. lung 1431 vsepi pool 

851 . 157,736. lung SqCa R694 vs epi pool 

851 . 1 57,736. Lung SqCa-hf 1 649 

851. 157,736. Lung SqCa4if 1649 

851. 157,736. lung tumor 1055 vs epi pool 

851. 157,736. lung tumor 10ng 

851. 157,736. lung tumor 1370 vs epi pool 

851 . 135,920. lung tumor 1 647 

851. 135,920. lung tumor 1648 

851. 157,736. lung tumor 1685/ref.RNA 

851. 135,920. Lung Tumor 685 

851 . 157,736. lung tumor 688 vs epi pool 

851. 135,920. lung tumor 734 

851 . 1 35,920. lung tumor 735 

851 . 135,920. lung tumor 737 

851. 135,920. lung tumor 738 

851. 135,920. lung tumor 739 

851. 101,241. Lung tumor hf 842 

851. 135,920. Lung Tumor HF-0017083 

851 . 135,920. lung tumor hM291 

851 . 1 57,736. lung tumor hf-1 333 

851 . 1 57,736. lung tumor hf-1 340 

851. 157,736. lung tumor hf-1 364 

851 . 157,736. lung tumor hf-1 366 

851. 157,736. lung tumor hf-1587 

851 . 157,736. lung tumor hf-1 587 v normal 

851 . 1 57,736. lung tumor hM 587 v. normal 

851 . 1 35,920. lung tumor hf-1 596 

851 . 135,920. lung tumor hf-1 646 

851. 135,920. lung tumor hf-1 649 

851. 135,920. lung tumor hf-1 655 

851. 135,920. lung tumor hf-1 71 9 

851. 157,736. lung tumor hf-1 775 

851 . 157,736. lung tumor hf-1785 

851. 135,920. Lung Tumor HF1602 

851. 135,920. Lung Tumor HF1651 

851. 135,920. Lung Tumor HF1729 

851. 101,241. Lung Tumor HF631 

851 . 1 01 ,241 . Lung tumor hf840 

851 . 1 57,736. lung tumor R1 057 vs epi pool 

851 . 1 57,736. lung tumor R1 094 vs epi pool 

851 . 1 57,736. lung tumor R1 094 vs epi pool 

851 . 1 57,736. lung tumor R1 372 vs epi pool 

851 . 1 57,736. lung tumor R1 372 vs epi pool 

851 . 1 57,736. lung tumor R41 7 vs epi pool 



Raw Ratio Normalized Ratio 
(sample/pooled (sample/normal 
epithelial) lung) 



0.563 


0.990 


0.634 


1.115 


0.847 


1.489 


0.337 


0.593 


0.209 


0.367 


1.085 


1.908 


0.858 


1.509 


0.404 


0.710 


0.444 


0.781 


0.633 


1.113 


0.886 


1.558 


0.18 


0.316 


0.443 


0.779 


0.456 


0.802 


2.136 


3.756 


0.789 


1.387 


0.884 


1.554 


0.655 


1.152 


0.849 


1.493 


0.467 


0.821 


0.078 


0.137 


0.496 


0.872 


0.514 


0.904 


0.343 


0.603 


0.634 


1.115 


0.478 


0.840 


0.442 


0.777 


0.404 


0.710 


0.491 


0.863 


0.258 


0.454 


0.278 


0.489 


0.597 


1.050 


0.612 


1.076 


0.422 


0.742 


0.429 


0.754 


0.346 


0.608 


0.592 


1.041 


0.611 ' 


1.074 


0.627 


1.102 


0.579 


1.018 


0.523 


0.920 


1.004 


1.765 


0.813 


1.430 


0.808 


1.421 


0.739 


1.299 


0.431 


0.758 



851 . 1 57,736. lung tumor R542 vs epi pool 0.454 0.798 

851 . 1 57,736. lung tumor R543 vs epi pool 0.651 1 .145 

851 . 1 57,736. lung tumor R544 vs epi pool 1 .407 2.474 

851. 157,736. lung tumor R544 vs epi pool 0.655 1 .152 

851 . 157,736. lung tumor R685 vs epi pool 0.894 1 .572 

85 1 . 1 57,736. lung tumor R693 vs epi pool 0.836 1 .470 

851 . 157,736. lung tumor R693 vs epi pool 0.527 0.927 

851 . 157,736. lung tumor R737 vs epi pool 0.479 0^842 

851 . 157,736. lung tumor R742 vs epi pool 0.526 0.925 

851 . 1 57,736, lung tumor R777 vs epi pool 1 .041 1 .830 

851 . 1 57,736. lung tumor R777 vs epi pool 0.573 1 !o08 

851 . 1 57,736. lung tumor R789 vs epi pool 0.849 1 .493 

851 . 1 57,736. lung tumor R789 vs epi pool 0.707 1 .243 

851 . 157,736. lung tumor R791 vs epi pool 0.869 1 .528 

851 . 1 57,736. lung tumor R791 vs epi pool 0.71 1 1 ^250 

851. 135*920. Lung Tumor RNA689 0.209 0^367 

851 . 1 35,920. Lung Tumor RNA691 0.536 0^942 

.851. 135,920. Lung Tumor RNA693 0.441 0/775 

851 . 1 57,736. lung tumor vs epi pool 1 6.61 3 29^21 1 

851. 157,736. lung tumor vs epi pool 0.443 0/779 

85i. 157,736. lung tumor 15.387 27.055 

851. 157,736. lung tumor/Ref.RNA 0.761 1338 

851. 157,736. lung tumor1582/Ref.RNA 0.92 1^618 

851 . 1 57,736. lung tumorl 683/epipool 1 .35 2.374 

851 . 1 57,736. lung tumorl 685/epipool 0.707 1 ,243 

851. 157,736. lung tumorl 853/REF.RNA 1.491 £622 

851. 101,241. lung tumorhf641 0.911 1^602 

851. 157,736. LungBaCavs Epi 0.342 o!601 

851 . 1 57,736. LungBAcal 662 vs Epi 3.362 5^91 1 

851. 157,736. LungBAca 2.466 4.336 

851. 157,736. LungCaSq-hf1602 0.713 1.254 

851. 157,736. LungCaSq-hf1647 0.885 1 556 

851. 157,736. LungSqCa-hf1293 1.291 2270 

851. 157,736. LungSqCa-hf1646 0.411 0/723 

851 . 1 57,736. LungSqCa-hf ? 1.112 1 ^955 

Normal Luna Samples 

851. 157,736. normal lung 795 0.221 0.389 

851. 157,736. normal lung hf-1 773 0.516 0.907 

851. 157,736. N. lung R1415 vs univ. ref 0.991 1 .742 

851 . 1 57,736. N. lung R1 431 vs univ.ref 0.709 1 !247 

851. 157,736. N. lung R417 vs univ.ref 0.669 1.176 

851 . 1 57,736. lung Normal 1 052 vs epi pool 0.603 1 .060 

851. 157,736. lung Normal 1417 vs epi pool 0.317 0.557 

851 . 1 57,736. lung Normal R417 vs epi pool 0.982 1 .727 

851. 157,736. lung Normal R419 vs epi pool 0.335 0.589 

851. 157,736. 423LunglnfIamTA1 0.517 0.909 

851. 157,736. 488-551 LunglnflamTAI 0.486 0.855 

851 . 1 57,736. 488-552lnf ImdLungTAI 0.425 0 747 

851. 157,736. 1054LunglnflamTA1 0.56 0985 



157,736. 1415LunglnflamTA1 0.706 
157,736. 1415LunglnflamTA1 0.494 



EXHIBIT C 



PERSPECTIVES 



mmmm 

Expression profiling — best practices 
for data generation and interpretation 
in clinical trials 

The Tumor Analysis Best Practices Working Group* 



Microarrays am routinely used to assess 
mRNA transcript levels on a genome-wide 
scale. As use and acceptance increases, 
there is intensified focus on appropriate 
methods of data generation and 
interpretation, with Important questions 
being asked about the best data analysis 
methods. The development of such 'best 
practices' Is needed, as microarrays - in 
particular, Affymetrix oEgonucleotlde arrays 
— are becoming increasingly important 
In human clinical trials, both for 
differentia] diagnosis and monitoring 
of pharmacological efficacy. Here, 
representatives from high-volume 
microarray core centres consider the 
current status of 'best practices 1 , 
focusing on the broadly used Affymetrix 
oligonucleotide arrays. 

Microarrays represent a major technological 
advance in molecular biology. The introduc- 
tion of any such advance is typically followed 
by a period of optimization and standardiza- 
tion. The latter is a crucial part of any maturing 
technology, as it allows an approach in which 
advances are made in parallel by individual 
researchers and companies who contribute 
new knowledge based on the existing stan- 
dard. Any such standards must be constantly 
reassessed; stale or stagnant standards can 
inhibit the development of the technology. 

Microar ray-based mRNA- expression pro- 
filing can be considered to be the first mature 
genome-wide analysis technology, reflected in 
an increased interest in using microarrays as 
an endpoint in clinical trials. However, regula- 
tions of clinical trials require the development 
of clear standards for use and interpretation of 
microarray data (commonly referred to as 
quality control and standard operating proce- 
dures (QC/SOPs) and/or 'best practices'). 
Guidelines for reporting and annotation of 
microarray data from the Microarray Gene 
Expression Data (MGED) Sodcty (see online 
links box) — using MIAME (Minimum 
Information About A Microarray Experiment) 
standards (box J) and the MAGE-ML mark-up 



language 1 - 1 — represent an important step 
towards this goal. The efforts of this multina- 
tional academic-industry partnership has 
made it possible to develop databases that 
can house the many types of microarray data 
(see below) within the same data struc- 
ture, enabling some data queries between 
experiments and experimental platforms. The 
ArrayExpress microarray database 3 (see online 
links box) is the first major publicly accessible 
database that adheres to this universal data- 
presentation platform, and some prominent 
journals (such as Nature, Cell, EMBO Journal 
and The Lancet) now demand that published 
microarray data conform to the MIAME 
standards. In addition, microarray manufac- 
turers, such as Affymetrix, have implemented 
MIAME- compliant data output in their new 
software releases. 

The MGED Society has effectively devel- 
oped data-reporting guidelines, but it has 
not addressed issues of data generation and 
interpretation. The latter are more intimately 
coupled to the specific experimental plat- 
form. Of the three commonly used types of 
microarrays (spotted cDNA, spotted oligonu- 
cleotide and Affymetrix arrays), each has dis- 
tinct methodologies associated with them; 
accordingly, the issues of data interpretation 
are also different (BOX 2). These differences 
make it difficult or impossible to develop 
cross-platform guidelines for data generation 
and interpretation. Best practices for spot- 
ted cDNA arrays are especially problematic 
because the manufacture of the arrays varies 
considerably from place to place. In addition, 
all spotted arrays use co-hybridization of a 
test RNA sample labelled with one colour 
njuoftorHOKE with a control RNA labelled with a 
different colour to which the test is compared 
on the same spot. The output is in the form of 
a ratio of hybridization signals that is compa- 
rable to other experiments only if the same 
control RNA is always used. Therefore, the 
development of standards in spotted arrays 
would require all laboratories to use the same 
control RNA solution before data could be 
easily compared 



Manufactured oligonucleotide arrays 
(both mechanically spotted and synthesized 
in situ) have the advantages of being centrally 
produced under controlled conditions. 
Airymetrix morouiTiocxAPHy-produced arrays 
have been available for nearly 10 years, 
whereas mechanically spotted oligonucleotide 
arrays have only very recently begun to 
appear in the marketplace. For example, 
Agilent Technologies (see online links box) 
recently released 17,000 60-mer oligonu- 
cleotides printed five times each on glass 
slides (85,000 features). Spotted oligonu- 
cleotide arrays typically have a single spot per 
gene (single probe measurement), whereas 
Affymetrix arrays provide multiple measure- 
ments — a series of independent or semi- 
independent oligonucleotides query each 
RNA in solution (the probe set) (BOX 2). 
Affymetrix probe sets arc constructed from a 
series of perfect-match and paired-mismatch 
oligonucleotides, allowing some assessment 
of non-specific binding and performance of 
the probes. Overall, the Affymetrix probe sets 
provide a variety of measurements that allow 
robust measures of gene expression. The use 
of multiple perfect-match and mismatch 
probes for each gene enables the development 
of different methods of interpreting the 
hybridization patterns across the probe set 
and calculating a single 'expression level* or 
'signal* that reflect the gene*s relative expres- 
sion level. A number of probe-set interpreta- 
tion algorithms for Affymetrix arrays are 
available (see below for discussion). 



"(distinct methodologies] 
make it difficult or 
impossible to develop 
cross-platform guidelines 
for data generation and 
interpretation," 



The increasing use of Affymetrix micro- 
arrays, and the emergence of this technology 
as an endpoint in clinical trials, has led to 
requests to develop, in both the pharmaceuti- 
cal and academic research communities, best 
practices in data generation and analysis. 
Given the many differences between spotted 
cDNA, spotted oligonucleotide and Affy- 
metrix arrays, the best practices need to be 
developed separately for each experimental 
platform; this is in contrast to data reporting 
that can be standardized across ail platforms 
(BOXES I and 2). The Tumor Analysis Best 
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Box 1 [Tha fiilAME guidelines for data reporting F ^ 

The Microarray Gene Expression Data Society (MGED) is ^international discussion group of 
microarray experts, with the primary goal of developing methods for data shirring between 
experimental platforms. The mam output oOhis group has bero the Minimum Inform,? ti on 
About A Microarray Experiment (MIAME) guidelines for microarray da?a annotation an d • : : 
' reportmg. The guidelines have been adopted by a number of scientific journals and have recently 
been endorsed for use by the US Food and DmgAdxrunistration and the US Department of 
Agriculture for pharraacogenomics projects, - 

The MIAME guidelines include descriptions of experimental design (number of replicates, ■ 
nanii^of Jnolq# 

procedures an d p anmt eters, an d measurem en t data and spedfiratipns. These guidelines have been 
-most mroortantfor the sported cDNA and o^Hgonudeobdee^ in _ 

w&dvtkefle^ 

array date generation and porting between different iaborato5es.-The guidelines do not attempt 
to dictate how experiments should be done but rather provide adequate Information associated 
with any published or publi dy available experiment so that the "experiment can be reproduced. - 



Box 2 1 Microarray experimental platforms 

There are three different types of microarray in common use: spotted cDNAs, sported 
oligonucleotides and Afrymetrix arrays. 

Sported cDNA arrays 

Spotted cDNA arrays typically use sets of plasmids of specific cDNAs in gridded Liquid aliquots. 
The inserts of each done are typically amplified by PCR, and a few picoli tres are physically 
spotted onto glass slides by liquid-handling robots. Robotic spotters can spot 100,000 spots per 
slide, and duplicate sets of dones are often spotted. The advantages of spotted cDNA arrays are 
that the content of each microarray is determined by the researches with complete flexibility in 
number and type of cDNA dones spotted. Also, the cost per array is relatively low. as the done 
sets are a PCR-renewable resource and the glass slides are themselves inexpensive. The amount of 
the RNA that corresponds to each spot is determined relative to a second control RNA solution 
that is hybridized to the same spot, and a ratio is obtained. 

Disadvantages of spotted cDNA arrays indude the variable amount of DNA spotted in each 
spot, the 10-20% 'drop out* rate of failed PGR reactions or failed spots and mis-identification of 
dones {that is, the spot Is not what you think it is). Also, there is no control over the actual 
sequence of the done. As many gene-coding sequences contain regions of sequence that are 
shared with other genes, there are questions of specificity of the hybridization to the relativdy 
large cDNA inserts. Spotted cDNA arrays were embraced by most academic centres, owing to 
their flexibility and relatively low cost. 

Spotted oligonudeotide arrays 

These arrays are also built by liquid handling on glass slides; however, the input solution is a 
synthetic oligonudeotide (often 60-70-mers). The resulting spotted material is typically of 
known concentration, of known sequence and is single stranded (all advantages relative to 
spotted cDNAs). Most of the process can be automated, leading to less sample mix-up and less 
drop-oat of samples. 

Disadvantages of spotted oligonucleotides indude the relatively high cost of synthesizing large 
numbers of large oligonucleotides and the non-renewable nature of the resource. Spotted 
oligonudeotide arrays are becoming increasingly available. 

Afrymetrix GeneChips 

These micro arrays are factory designed and synthesized. Design is done using software to 
choose a scries of 1 1 25-mer probes from the 3' end of each transcript or predicted transcript 
in the genome; each of the 1 1 probes is then paired with a similar mismatch probe that is designed 
to contain a mutation in the centre. The latter serves as a form of control for hybridization 
specificity. Synthesis of arrays is done using light-activated chemistry and photolithography 
methods, and feature size can be reduced to approximately 8 um', with about 1 million probes in a 
1 2 cm 3 glass area. Probe-set algorithms interpret the signals from each 22 -oligonucleotide probe 
set, and derive a single value (signal) from the patterns of hybridization to the 22 individual probes. 
This signal is then normalized to the entire microarray, or to the probe sets across an entire project 

For a more general discussion of normalization and analysis methods of different microarray 
platforms, the reader is referred to the excellent web information resource of the MGED group 
(see The MGED Data Transformation and Normalization Working Group in online links box). 



Practices Working Group (see box 3) was 
convened to discuss and devdop best prac- 
tices for Afrymetrix microarrays, including 
QC and SOPs for both data generation and 
data analyses. The first meeting was held in 
Santa Clara in March 2003, followed by a 
series of conference calls that focused on dis- 
cussions of data generation and analysis stan- 
dards for the Afrymetrix oligonucleotide 
arrays. The Working Group deliberately 
focused on a platform that has widespread 
usage and is most likely to be used in clinical 
trials owing to the previously standardized 
manufacturing process. Here, we discuss rec- 
ommendations for experimental design, 
probe-set analysis algorithms, signal/noise 
assessments and biostatistical methods. 

Experimental design 

Appropriate experimental design is a key 
aspect of all science, and microarray studies 
arc no exception. The relatively high cost of 
some commercial microarray platforms is a 
frequently dted reason for suboptirnal experi- 
mental design, especially with regards to the 
number of replicates. Data interpretation is 
inevitably compromised when replicates are 
decreased. 

Replication in cross-sectional studies. The 
appropriate number of microarray replicates 
for any particular condition or time point 
depends on the source of biological variability 
in the study samples. Inter-individual variabil- 
ity is very large in outbred (genetically hetero- 
geneous) humans, but is very small within 
inbred mouse strains. For example, expres- 
sion profiles derived from muscle from differ- 
ent mice are not more variable than from 
musdes isolated from one mouse 4 . Defining 
the confounding variables that contribute to 
experimental variability, such as intra-subject, 
inter-subject, inter-group and technical vari- 
ation (microarray protocol), is needed to 
design and statistically power a study, and 
to determine the number of replicates that are 
needed. In general, inbred mice require test- 
ing only three or four mice per group. We and 
others have found that five or six out-bred 
rats per group provide statistically robust 
results 5 - 6 . By contrast, human samples require 
considerably more individuals per group. Key 
variables in human samples include tissue 
heterogeneity, stage of disease and inter- 
individual variation, all of which have been 
found to be major confounding variables 7 . 

Replication in longitudinal studies. It has long 
been recognized that, in human clinical trials, 
LONcrruDiNAL designs provide considerably 
greater power at lower numbers of replicates. 
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Box 3 1 The "Himor Analysis Best Practices Working Group* 

The Tumor Analysis Best Practices Working Group is a group of investigators who study the best 
practices of tumour analysis in humans taldngpart in clinical trials. The following authors are 
members of the Group: 

• Eric P. Hoffinan is at the Research Center for Genetic Medicine, Children's National Medical 
Center, Washington DC 20010, USA. email: ehoffinan@cnmcrcsearch.org 

• Tarif Awad, John Palma, Teresa Webster, Earl Hubbell and Janet A. Warrington are at Affymetrix, 
Santa Clara, California 95051, USA. emails: tarif_awad@arrymetrix.com; 
Jolm_palma@afjymetrix.comi teresa_webster@afrymetrix.com; earlJiubbeU@afxymetrix.com; 
janet.warriogton@afrymctrix.com 

• Avrum Spirals at The Pulmonary Center, Boston University Medical Center and the 
Bioinfbrmatics Program, Boston University, Boston, Massachusetts 021 18, USA* e-mail: 
aspira@lung.bumcbu.edu 

■ George Wright is at theBiometric Research Branch, Division of Cancer Treatment and 
Diagnosis, National Cancer Institute, National Institute of Health, Bethesda, Maryland 20392, 
USA. e-mail: wrightge@m ail. nih. go v 

• Jonathan Buckley and Tim IHche are at the Children's Hospital, University of California, Los 
Angeles, California 90089, USA e-mail: buckley@hscusc.edu; triche@hscuscedu 

• Ron Davis, Robert Tibshirani and Wenzhong Xiao are at Stanford University, Palo Alto, 
California 94303, USA. e-mails: dbowe@stanford.edu; tibs@stat.stanford.edu; 
wzxiao@pmgm2_tanford.edu 

■ Wendell Jones is at Expression Analysis Inc, Durham, North Carolina 27713, USA. e-mail; 
wjones@expressionanalysis.com 

■ Ron Tompkins is at Harvard University, Boston, Massachusetts 021 15, USA. e-mail: 
rtompkiiis@par_iers.org 

• Mike West is at the Institute of Statistics and Decision Sciences, Duke University, Durham, 
North Carolina 27708, USA. e-mail: mw@stat.duke.edu 



They best control for inter-individual varia- 
tion because each subject serves as their own 
control. Serial blood sampling from single 
subjects is the least invasive 1 (see below for 
further discussion), and, for example, cancer 
patients axe often longitudinally sampled 9 . 
Serial biopsies of other tissues are more inva- 
sive: however, a number of serial human mus- 
cle biopsy studies of healthy volunteers after 
different types of exercise training have begun 
to appear in the literature 10,11 . 

Expression profiling of blood samples 
(longitudinal or crosssectionai design) is the 
protocol that is most likely to be used in 
human clinical trials. One of the Working 
Group's goals was to establish SOPs for blood 
sample collection and RNA isolation in clini- 
cal trials. A specific follow-up report of these 
recommendations will be published else- 
where. Such a protocol must be easily adapt- 
able to multiple trial sites, with relatively little 
need for resident expertise to carry out the 
isolation protocol- So far, standard methods far 
isolating peripheral-Wood mononuclear cells 
have shown the most reproducibility, although 
others are being tested (see Asymetrix Tech- 
nical Note in online links box). Cells isolated 
soon after collection can be flash frozen for 
storage and subsequent RNA isolation or an 
RNA stabilizing compound can be added if 
the samples need to be transported. 

Tissue/ cell heterogeneity. Tissue heterogeneity 
is a major confounding variable in most micro- 
array experiments. In inbred mice, tissue het- 
erogeneity is typically normalized by using 
whole organs. This is rarely possible in human 
experiments, and particularly not in clinical 
trials; the limited amount of human tissue 
that is available exacerbates heterogeneity. The 
mixed cell populations of peripheral blood 
can bethought of as a tissue heterogeneity 
problem similar to that encountered in all 
solid tissue and tumour biopsies. Indeed, a 
recent study showed that variation as a result 
of tissue variability in human musde biopsies 
often exceeded inter-incUvidual variability 12 . 
One potential solution to the tissue hetero- 
geneity problem lies in bioinformatic meth- 
ods. If computer software can be trained to 
recognize the expression profiles of each indi- 
vidual cell type within a mixed tissue sample, 
then it should be possible to subtract them 
from each other and to renormalize to obtain 
a set of cell-specific expression profiles derived 
from a single mixed profile This will be most 
easily done on tumour biopsies, in which the 
main cells of interest are tumour versus conta- 
minaong normal tissue. Although there are no 
published examples so far, such methods are 
maturing rapidly. 



An experimental alternative to mitigate 
confounding tissue heterogeneity is to isolate 
pure cell populations for expression profiling. 
Many such methods are well developed in the 
research laboratory, including fluorescence- 

ACTIVATED CELL SORTING ( FACS) 1 NEGATIVE CELL 

isolations from blood (for example; Stem Cell 
Technologies RosetteSep) 14 and laser capture 
microdissection 15 . To research scientists, the 
profiles that are derived from isolated cell types 
are a more intuitive approach to define biologi- 
cally relevant pathways. However, it should be 
noted that uses of array-based analysis of gene 
expression approved by the US Food and Drug 
Ad'ministration (FDA) will probably focus on 
reproducibility and robustness (as well as on 
predictive accuracy), rather than on biological 



"If computer software can 
be trained to recognize the 
expression profiles of each 
individual ceil type within a 
mixed tissue sample, then it 
should be possible to subtract 
them from each other. . . " 



interpretation or justification. The high-tech 
methods used to isolate specific cell types from 
clinical samples arc unlikely to make their way 
into clinical trials unless tissues are procured in 
a highly centralized way. 

Procedural variation. Beyond the usual issues 
of sampling and accrual, gene- expression data 
will be subject to many additional sources of 
error. For example, the surgical removal and 
processing of tumour tissue can vary consid- 
erably from site to site. Laboratory QC proce- 
dures in tissue handling, RNA extraction and 
processing, and variations in protocols for 
data management and processing will need to 
be addressed in any clinical trial design. In 
particular, prolonged tissue ischaemia prior to 
processing of surgically resected tissue can sig- 
nifjcandy alter gene expression 16 . All tissue 
samples should be flash frozen within min- 
utes of surgery and stored at -80°C or below. 
Samples should also be kept in small, airtight 
containers and kept from drying out during 
frozen storage by placing fragments of ice in 
with the sample. 

Technical variability 

The standard laboratory protocol for generat- 
ing UNA profiles using Asymetrix microarrays 
involves a series of steps (FIG. I). 
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Figure 1 1 Sample processing and micro array 
Interpretation of Asymetrix GeneChlps. Rash- 
trozen tissue (-50 mg) Is homogenized to Isolate 
total RNA SInete-strandod <ss) cONA and then 
doube-stranded (ds) cONA is made from -5 pg 
of total RNA. Double-stranded cDNA contains a 
T7 RNA-pdyrnerase promoter adjacent to the 
3" pofyAtaa of each transcript. It is transcribed 
in vitro to generate mora than 400 biotiny!ated 
cRNAmctecutes for each ds cONA motaajle. The 
biotinylated cRNA Is fragmented and hybridized 
to the mtoroatrays. Each transcript is queried by 
one or mora probe sets of 1 1 perfect-match end 
1 1 paired -mismatch oEgonucteoUdes (the latter 
contain a centrally located point mutation as a 
form of hybridization specificity control). Currently 
available Afrymetrfat microarrays have -54,000 
probe sets on each 1 .28 cm* glass microarray 
(-1 .2 miEon 25-mer oligonucleotides on the 
HG-U1 33Plus 2.0 array). The btotinylated 
cRNA fragments hybridize to the appropriate 
oligonucleotide features. A laser scanner determines 
the amount of bound bbtinyfalod cRNA Indirectly 
through the streptavidin- conjugated phycoerythrin 
fluorescence at each feature wfthin a probe set. The 
component probe pairs are interpreted end 
averaged to arrive at a single signal that reflects the 
relative abundance of the original mRNA Probe 
sets are interpreted by any one of a number of 
probe-set algorithms, each providing a signal that 
reflects the relative hybridization intensity across the 
probe set RT. reverse transcriptase. 



RNA isolation. RNA quality and quantity is 
crucial to the success and reproducibility of 
the expression profiles. RNA quantity and 
quality is generally checked by complemen- 
tary methods: UV 260/280 ratio >1.8, agarose 
gel electrophoresis or an Agilent Bioanalyzer 
to visualize dear 18S and 28S ribosomal RNA 
bands. Total RNA (5-10 pg) is input into the 
cDNA/cRNA reaction, with an expected cor- 
rected yield of biotinylated cRNA of between 
4- and 10-fold greater than the total RNA 
input (so 5 pg of total RNA must yield at least 
20 pg of biotinylated cRNA, or the sample is 
discarded). The biotinylated cRNA should be 
500-3,000 base pairs (bp) in size. After frag- 
mentation, the cRNA should be 50-200 bp. 
The Working Group recommends that sam- 
ples that do not meet these criteria should be 
discarded. 

If RNA amount is limiting — as is the case, 
for example, with laser capture microscopy 
samples, flow-sorted cell samples or small tis- 
sue samples — a two-round amplification 
protocol can be used. For example, 200 ng of 
total RNA is processed for in vitro transcrip- 
tion (IVT), with the same goal of 4- 10- fold 
amplification (>800 ng of cRNA output). 
One hundred nanograms of this cRNA is 
then reverse transcribed into cDNA using 
random primers, after which a second IVT is 
done. The second round IVT should result in 
a 400- fold amplification. 

Microarray controls. Hybridization controls 
include visualization of the image so that any 
abnormalities in hybridization patterns can be 
detected ProbeProfiler from Corimbia Inc. is 
a program with extended capabilities for 
detecting defects in microarray manufacture. 
Afrymetrix MAS 5.0 software adjusts the 
microarray-scanned image to a common tar- 
get intensity by using a scaling factor. In addi- 
tion, a general index of chip background and 
noise is represented by the percentage of 'pre- 
sent calls' (probe sets for which the hybridiza- 
tion to the perfect-match probes is significantly 
higher than mismatch hybridization). The 
Working Group believes that both the scaling 
factor and the percentage of present calls arc 
important QC criteria. Considering MAS 5.0 
chip analyses, the scaling factors to normalize 
chips within a project should lie within two 
standard deviations of the mean, with present 
calls being greater than 25% (box The per- 
centage of present calls is often lower when 
B or C arrays that contain higher propor- 
tions of more poorly characterized transcript 
units (expressed sequence tags or computer- 
predicted open reading frames) are used. The 
percentage of present calls across a set of sam- 
ples should be consistent, within a range of 



10%. Some software packages allow the iden- 
tification of statistical 'outlier* microarrays in 
a group of microarrays in a given project, 
which additionally enables the experimenter 
to flag and exclude specific microarrays that 
are not acceptable for an analysis. In addition 
to these criteria, acceptable hybridizations 
must have adequately intact input RNA as 
shown by 3' to 5' ratios of hybridization within 
probe sets. A typical control is the gfyteralde- 
hyde 5- phosphate dehydrogenase (GAPDH) 
gene, which should have 3' to 5' ratios of less 
than 3 (BOX 4). 

The QC criteria provided above are based 
on MAS 5.0 probe-set algorithms and data 
analyses. The measures of present calls and 
scaling factors are useful and serve as initial 
summary measures of the performance of a 
particular microarray. However, more focused 
statistical methods, coupled with routine 
visual inspection of images, hold promise for 
the continuing improvement of data quality 
and screening abilities. 

Large-scale analyses of microarray data 
across laboratories have not yet been reported. 
However, the Working Group feels that adher- 
ence to the above QC criteria, using standard 
RNA isolation and processing methods, should 
yield data that are consistent between laborato- 
ries and intrinsically comparable. The same 
set of criteria can also be used as best practices 
for data generation in the design and conduct 
of clinical trials. 

Standard clinical laboratory practice is 
to develop programmes for submission of 
known samples to different laboratories 
and assessment of comparability of results. 
Such programmes are under development 
within larger collaborative efforts, such as 
the National Heart, Lung and Blood Institute 
(NHLBI) Programs in Genomic Applic- 
ations (see the HOPGENE Program for 
Genomic Applications in online links box) 
and the National Institute of General Medical 
Sciences (NIGMS) Glue Grant (see online 
links box). 

Data analysis and Interpretation 

Signal generation versus statistical analyses. 
Two relatively distinct steps underlie all 
data analyses of Aifymetrix oligonucleotide 
microarrays: the development of a normal- 
ized 'signal* for each transcript on each 
microarray and the subsequent statistical 
analysis of differences in signals between dif- 
ferent arrays. The first step involves probe- 
set algorithms that use all, or part, of the 
component signals within a probe set and 
then derive a single signal that is representa- 
tive of the relative abundance of each mRNA 
queried in each array. The second step is the 
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Box 4 [ Quality Control metrics for Affymetrij microanraya 

RNA quality 

Optical density 260/2S0 of 1.8-Z1 I Agilent Bio-Analyzer I Gel electrophoresis 
cDi^dP^ efficiency '" 

>4-foId amplification from total UNA 1 500-3,000 bp prior to fragmentation 1 50-200 bp after 
fragmentation 

Chip hybridization 

Image inspection for defects I Scaling factors within two standard deviations within a project I 
MAS 5.0 present calls >25% for the a-series arrays, including the HG-U133Plus 2.0 array I 
Percentage present calls for the Br and C-serjes arrays are typically lower l3*/5* GAPDH 

'rafios'<3:0 ; . ; • \l r : - -•: : " ^ !" .-. ':■ ' :\ 

]Pr$ertii<nT^^ 

The detection of statistical oudiers for chips, probe sets or individual probe pairs requires 
normalization and analysis across an entire project This is afforded by the dCHIP and 
ProbeProfiler, and other software packages Data- analysis packages that rely on intra-chip 
norraalizauon and scaling typjcally do not enable detection of statistical outliers.' 
Chip outliers 

Probe-set outliers I Probe-pair outliers I Range in present calls < 10% 



application of bioinformatic and statistical 
methods to identify interesting subsets of 
the assembled data of alt arrays within a 
project. There is considerable debate about 
the best methods for both of these steps (see 
below for a discussion). Although the two 
steps are separable, it is clear that they have a 
marked influence on each other. It is in this 
realm that the bioinformatics of microarrays 
becomes avant-garde, and with the ground- 
breaking nature of research comes consider- 
able debate as to what is appropriate in any 
specific situation. 

Before discussing the different methods for 
probe-set analysis and data interpretation, it is 
important to point out that much of the 
debate in the field of bioinformatics about 
imaoarray interpretation revolves around sig- 
nal/noise ratios, A common assumption is that 
signal/noise ratios across a micro array are 
homogeneous, or at least similar in magnitude. 
This might be true for general background 
hybridization, but not for the performance of 
probe sets. In any particular microarray, there 
are probe sets that give very strong and clear 
hybridization patterns and those that perform 
poorly Many of the bestperfbrniing probe sets 
(those with a highly significant probe-set 
detection p value) reflect highly expressed tran- 
scripts with no closely related sequences that 
might cross-hybridize. Low-level transcripts, or 
transcripts that belong to gene families with 
highly homologous sequences derived from 
distinct genes, often have corresponding probe 
sets that do not perform as well and might 
have a significant, if not overwhelming, noise 
component. The signal from such probe sets 
is difficult to interpret, and data interpreta- 
tion can be limited to only the best perform- 
ing probe sets, although arguably the most 
interesting data comes from the genes that 
are expressed at low levels but that still show 
significant differences between samples. 

Determining adequate sensitivity of the 
signals and signal/noise responses relative to 
the absolute quantity of mRNA in clinical 
samples is crucial as microarrays become a 
component of clinical trials and diagnostic 
models. Affymctrix arrays provide a concen- 
tration of each mRNA queried relative to the 
genome-wide mRNA profile of the sample; it 
is assumed that the global mRNA content of a 
tissue as a whole does not change significantly; 
making relative mRNA quantification an 
accurate reflection of the response of the indi- 
vidual gene. This method differs from absolute 
quantification of specific mRNAs (such as 
si nucpase protection and REAt^TiME pcr), or the 
isolated transcript ratio determined by co- 
hybridization of two samples to spotted 
cDNA or oligonucleotide arrays (box 2). 



Aflyrnetrix arrays achieve considerable sensi- 
tivity through the inherent redundancy of 
the probe set; however, the Working Group 
acknowledged that some genes, such as some 
cytokines that are functional at very low 
expression levels, are probably below the limit 
of detection. 

The Working Group agreed that each 
project will have its own signal/noise opti- 
mum, and analysis methods that prove best 
for one project might prove unsuitable for 
another. Ideally, a signal/noise ratio should 
be optimized for each project or trial using 
different probe-set algorithms and data- 
filtering methods, and some systematic 
efforts towards this end are beginning to 
appear in the literature 17 . 



"... adherence to the above 
QC criteria . . . should yield 
data that are consistent 
between laboratories and 
intrinsically comparable." 



After a signal is derived for each probe set, 
data is interpreted using statistical and visual- 
ization methods. All statistical methods run 
into two generic problems when faced with 
microarray data that are inter-related. The 
first is the curse of dimensionality — each 
gene is potentially related to every other gene, 
so all permutations of all available data must 
be considered, leading to an exponentially 
increasing number of possible associations in 
multidimensional space. The problem arises 



when associations (samples) become lost as 
the dimensionality increases — associations 
lose their local value and become gcnericaUy 
global in statistical terms. Statistical models 
attempt to circumvent this curse by requesting 
larger and larger sample sizes, but fulfilling 
the requests becomes functionally impossi- 
ble for the experimentalist. There is no easy 
answer to these problems and they remain a 
challenge for future bioinformatics research 
that uses microarrays ,, . 

Derivation of signal; probe-set algorisms and 
normalizations. One of the key advantages of 
the Affymetrix platform is the multiple mea- 
surements that are intrinsic to the probe set 
— most probes include 1 1 perfect- match and 
1 1 paired-mismatch 25-bp oligonucleo- 
tides per gene (fig. i). Previous versions of 
GeneChip arrays used probe-set design meth- 
ods that led to considerable overlap between 
probes, so that hybridizations to each fea- 
ture/probe were not independent measure- 
ments; this led to considerable uncontrolled 
weighting of the contribution of any particu- 
lar region of sequence to the resulting signal. 
All recent chips use a much more refined 
probe-set design with less overlap and consid- 
erably better performance of the probe set. 
Improvements in array and probe- set designs 
have been accompanied by an evolution in 
primary analysis algorithms and the support- 
ing software provided by Aflymetrix for data 
analysis and interpretation 19 . Aflymetrix 
default algorithms are based on well-docu- 
mented statistical methods, namely the robust 
TUKEV5 Bt-wHorr estimator and WIXXOXOJTS SIGNED 
rank, to calculate the final probe-set signals 
and associated p values, respectively 1 '' 20 . 
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Table 1 1 Comparisons of probe-set analysis algorithms 

Algorithm . . . Penalty for - - : Normalization method Qutljer detection- Sensitivity* ■ Specificity* 
mismatch signal ~~ - ' -and correction-,- " " 

Affymetrix MAS 5.0 High Individual chfps Uttle Good Excellent 

ojaHBdtfference". " High . (^oss-project . - .Moderate / /. Good - 'Excellent 

<*CHlP None Cross-project Moderate Excellent Good 

RMA . None Cross-project " Moderate Excellent : Good 

ProbeProfiler Moderate Extensive Extensive Good Good 

-Sensitivity is based primarily on ROC {receiver operating characteristic) curves of spike-in mRNA data based on published reports (see mtp^/^^.bkxx^dcr.cjra} 21 - 21 . 
'Spedficfty measurements are based both on expectations from mismatch weights and published observations In experimental data sets™". 



Affymetrix has announced plans to con- 
tinue to improve the software components of 
the GencChip pIatform.The upcoming release 
of the GeneChip Operating System (GCOS) 
is expected to incorporate refinements in the 
user interface, data management and analysis 
algorithms. Software tools aside, the most sig- 
nificant development on the analysis front is 
arguably the decision by Affymetrix to release 
previously proprietary chip-design details* 
such as probe sequences, chip-design parame- 
ters and file APIs (applications programming 
interfaces). The goal is to encourage scientists 
to develop innovative analysis tools that can 
potentially derive more biological value from 
GeneChip expression data. The challenge of 
providing a constantly growing and evolving 
body of information associated with arrays 
has been solved in part with a web-based 
tool. The company's NETAFFX web site (see 
online links box) serves as the public portal 
for detailed information on chip design and 
has become a valuable resource for biological 
follow-up of GeneChip expression results. 
Third-party software developers can find 
additional support, including information on 
file APIs, through the Affymetrix Developers* 
Network (see online links box). 

Encouraged in part by the openness of the 
platform and spawned by an increase in 
knowledge and experience in array data 
analysis, scientists are developing a number of 
alternative algorithms for probe-set analysis, 
with the goal to derive the best signal that is 
representative of the mRNA level for each 
gene. As each signal is relative to other signals 
in the experiment (both between arrays for 
the same gene and relative to all other genes 
on the array), the process of normalization is 
intimately tied to derivation of signal. The 
more commonly used alternative probe-set 
analysis algorithms include dCHIP 20 , RMA" 
and ProbeProfiler (table t). 

It is outside the scope of this artidc to dis- 
cuss the nature of the different probe-set 
interpretation and normalization algorithms 
in depth, and the reader is referred elsewhere 0 . 



The algorithms differ in a number of impor- 
tant ways (table i). First, the penalty weicht 
that is assigned to the mismatch probe varies 
— MAS 5.0 assigns a relatively heavy penalty 
for cross-hybridization to the mismatch 
probe, RMA assigns no weight and dCHIP 
gives the choice of providing weight or no 
weight Second, the ability to discard outlier 
signals varies from package to package, with 
dCHIP and ProbeProfiler having refined 
methods to detect outliers at each level of 
analysis (probe, probe set and microarray). 
These packages are able to replace deviant 
probes with expected data based on the 
remainder of the probe set, and/or flag abnor- 
mal probe sets and arrays for possible exclu- 
sion from further data analysis. Third, the 
method of normalization varies from within a 



. .robust feature selection 
for the purpose of diagnosis 
and molecular markers in 
clinical trials requires robust 
statistical methods. . . n 



single array (MAS 5.0) to a project-based nor- 
malization (dCHIP, RMA and ProbeProfiler). 
Finally, MAS 5.0 provides a detection p value, 
in which a number is assigned to the confi- 
dence of the signal in question. This can be 
used to weight different probe-set signals in 
subsequent data interpretations. 

The output of all packages is a normalized 
signal (with or without an associated detec- 
tion p value) for each probe set on each array. 
These signals arc then fed into data interpre- 
tation packages for statistical analyses and 
data visualizations. 

Different probe-set interpretation algo- 
rithms lead to different results. Members of 
the Working Group often encounter ~50% 



concordance in general data output in their 
own work between comparisons of two dif- 
ferent algorithms. However, it is crucial to 
note that the large majority of discordant data 
lies in regions of relatively poor signal/noise 
ratios, and concordance deteriorates in exper- 
iments with high levels of confounding noise. 
In general, the programmes that put less 
weight on the mismatch show better sensitiv- 
ity (linearity) when signals are noisy (TABLE l). 
However, mis increased sensitivity can come 
at a cost of substantial contaminating noise 17 . 

The Working Group recommends using at 
least two probe-set algorithms for compari- 
son and prioritization of gene selection (for 
example, MAS 5.0 and the dCHl P difference 
model). 

Data interpretation. Most published microar- 
ray papers could be considered data-poor in 
terms of replicates and systematic statistical 
analyses, but data-rich both in terms of 
amount of high-quality data generated and 
significant research findings. Below, we point 
out the most appropriate current bioinfor- 
marics methods and additional methods that 
require further development so that data can 
be more fully mined for information content 

A second general backdrop to the follow- 
ing discussion is that data visualization is 
one of the most powerful data interpretation 
tools, yet it rarely obeys statistical principles. 
The resolution of the human eye, coupled 
with the abstract computational power of the 
human brain, lies behind the popularity of 
hierarchical clustering and other non-statistical 
principles and visualization methods. How- 
ever, the eye and brain are poorly suited to 
spontaneously deriving statistical support 

There are two general types of experimen- 
tal design that lend themselves to different 
types of statistical and visual analysis: the 
cross-sectional study and the time-sejues study. 
The cross-sectional study typically has gene or 
pattern selection as the goal: the identification 
of one or more genes or patterns of expression 
that arc diagnostic of the condition or state 
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under stud/. This 'gene selection' might be for 
truly diagnostic purposes (for example, differ- 
ential diagnosis of leukaemia), or might be 
intended to identify relevant biochemical 
pathways. In both cases, the gene or pattern 
selection must be robust, usually implying a 
statistically principled approach, with subse- 
quent validation by predictive computer mod- 
elling (internal cross-validation) or, preferably, 
prospective validation on new data- 
Feature selection can be the main limiting 
factor in evaluation of the predictive perfor- 
mance of an analysts method when there are 
many predictors to select from. This was a 
'mantra' for some of the senior statisticians 
involved in predictive modelling with gene- 
expression array data for several years, but 
only now do the non-statistical users and 
developers of predictive models from non- 
statistical perspectives begin to appreciate 
these issues. Proper validation of any model or 
algorithm that relies on explicit feature selec- 
tion — such as choosing a subset of 70 genes 
from 20,000 — that underlies the resulting 



prediction simply must ensure that the analy- 
sis is tested by internal cross-validation that 
includes feature rc-sdection as part of the vali- 
dation 23,2 *. The Working Group acknowl- 
edged that prospective validation of any 
findings using new data is the acid test of 
predictive performance. The focus on fea- 
ture or gene selection is vitally important 
when microarrays are used for differential 
diagnosis and has been best studied in cancer 
biopsy/tissue studies. 

An increasing proportion of micro array 
studies focus on delineation of biochemical 
pathways that are modulated in response 
to some stimulus. In practice, these studies 
typically use feature selection to identify 
potential pathways that arc involved in the 
response of the cells or tissues. Validation 
is then done on the identified biochemical 
pathways of interest, using mRNA (real-time 
PCR) or protein studies, often proving cause 
and effect in experimental models. 

The Working Group notes that robust 
feature selection for the purpose of diagnosis 



and molecular markers in clinical trials 
requires robust statistical methods, as out- 
lined below, and the burden of proof lies 
with statistical validation. For microarray 
experiments designed to delineate biochem- 
ical pathways, feature selection is used for 
generating a hypothesis and the burden of 
proof of the hypothesis lies with laboratory- 
based research, often at the protein leveL 

For feature selection, the Working Group 
recommends that users experiment with 
various statistical methods (such as standard 
parametric tests, nonparametric methods, 
false discovery rate and related methods 25 , 
global or local shrinkage of raw signal inten- 
sities and Stanford's 'nearest shrunken cen- 
troidV"). Developments related to survival 
data analysis arc receiving increased attention 
because clinical trials will raise the need to 
move that way. As a corollary, analysis meth- 
ods that focus on signatures of groups of 
genes (such as averages of clusters, Duke's 
metagenes 27 - 29 and Stanford's eigengenes 36 ) 
seem worth stressing in predictive contexts. 



Glossary 

A-.B- AND C-SEJUES ARRAYS 
A series of human, rat and mouse Affymetrix arrays 
released in 2003, in which the A array contained the 
best-characterized genes, and B and C arrays contained 
less well-defined expressed sequence tags. In 2004. all 
probe sets have been condensed so that there is only 
one microarray per species that covers the entire 
genome. 

CROSS-SECTIONAL DESIGN 

The use of different subjects in an experimental and 
control group or groups. The statistical analysis 
compares the median and variation within each 
group relative to ihe other groups. 

FEATURE 

Typically one dement (spot) on a microarray. 
In spotted cDNA or oligonucleotide arrays, 
features correspond to genes or transcripts; in 
Affymetrix arrays, there are typically 
22 elements per probe set and often multiple 
probe sets per gene, so a feature might refer 
to a single oligonucleotide, a probe pair or 
a probe set, or a gene with multiple probe 
sets. In bio informatics it is most often 
synonymous with a gene. 

aUORJESCENCE-ACTIVATED CELL SORTING 
(FACS). A method thereby dissociated and individual 
living cells are sorted, in a liquid stream, according to the 
intensity of fluorescence that they emit as they pass 
through a laser beam. 

FLUOROPHORE 

A small molecule, or a pan of a larger molecule, 
that can be excited by light to emit fluorescence. 

ISCHAEMIA 

The loss Of blood supply, and hence oxygenation, to a 
tissue or cells. 



LASER CAPTURE MICRODISSECTION 
A technique in which individual cells* or regions of tissue, 
are excised from a histological preparation, using specialty 
equipped microscopes, and isolated for further study. 

LONGITUDINAL DESIGN 

The use of multiple samples from the same subject. With 
this design, each subject serves as their own control, 
eliminating confounding inter-individual variations at 
baseline; paired (-tests are used to interpret the data. 

NEGATIVE CELL ISOLATION 

The use of antibodies or other reagents to remove all 
unwanted cells from a mixed population of cells. In this 
method, die desired ceils are not exposed to bound 
antibodies, thereby avoiding potential activatioo or other 
molecular alteration in the desired ceils. 

PENALTY WEIGHT 

in AfTymct rtx arrays, hybridisation to the 'mismatch* 
probe of a probe pair might or might not be considered 
as a form of measurement of anise or background, and 
can be factored into the signal seen with the paired 
'perfect match' as a penally weight. 

PHOTOUTHOGRAPHY 

The process of using light to either etch or activate 
regions of a surface (substrate). This method is used in 
microelectronics to create integrated circuits and 
processors. 

REAL-TIME PCR 

The quantification of the amount of PCR product 
during each cycle of a PCR reaction. The product 
concentration, as a function of cycle number, provides 
a good estimation of the relative quantity of the 
mRNA being tested. 

RESECTION 

Surgical removal of tissue, most commonly used for 
removing tumorous masses from surrounding tissue. 



S l NUCLEASE PROTECTION 
An experimental method for determining mRNA 
transcript concentration in a tissue or cell RNA sample. It 
involves using labelled DMA probes thai bind the RNA* 
with overhanging nbn-hybridized taib of the probe then 
being digested by the S! nuclease. This creates a smaller 
labelled DMA probe thai is indicative of the abundance of 
the mRNA being tested. 

SURVIVAL DATA ANALYSIS 

A battery of statistical methods applied to data when 
mortality Is often the only, or best, measured outcome. 

TIME-SERIES STUDY 

The use of a series of samples taken at defined time 
points after a defined stimulus. In mice and rati, the 
samples at different time points are usually from 
different animals. In humans, time-series studies are 
necessarily longitudinal to avoid additional confounding 
noise. 

TUKEY-5 BI-WEIGHT ESTIMATOR 
Many statistical tests require underlying definitions 
that are assumed to be valid (for example, tumour 
versus non-tumour), and require data that show a 
normal distribution. Microarray data, and the 
clinical information underlying the definition of 
samples, is often less exact, with genes or samples 
often performing as statistical outliers. Tukey's bi- 
weight estimator is one of the M-cJas* of statistical 
models that is less sensitive to outliers and performs 
more gracefully when underlying assumptions are 
inexact. 

WlLCnXON-S SICKED RANK 
A statistical test that investigates the population 
median of paired differences. It is wcil suited for 
microarray work as it treats each gene as an 
independent variable and does not require normal 
distributions of the data. 
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Rgure 2 [ Dense time-series data with adequate repOcates can provide robust visual 
Interpretation of data. Dynamic, single-gene queries can provide visually compelling results 
and avoid many Issues that compScate statistical analyses of cross-sectional mi croon-ay studies. 
Dynamic queries ol the got 1 transcript probe set from a time-series study Is shown. Th ex- axis represents 
time fin hours) after rats were given a bolus of 3-methytprecWsdorie. They-axJs represents relative 
expression level. InrJvWual rats were studied at each time point (green data points), with both liver (panel a) 
and muscle (panel b) tissue taken from the same animals, Averages d the recreates fror each data point are 
shown {in magenta); the graph finels drawn between the averages. A3 baseline (time 0). the got? transcript 
has a normalized signal of about 2,400 in Over and 1,200 in muscle. The gene is dearly responsive to 
3-nTethyIpreoVijsdone in liver (panel a), where expression rapidly increases within the first hour, plateaus 
between 1 -7 hr. then quickly falls back to basefine by 1 2 hr. The recreates appear relatively consistent 
By contrast, the same gene in muscle does not seem to respond to the drug; the variabifity in repCcates 
is larger than any temporally-relevant change. Data from hitpy/rricroam3y.cr™ 
singJegenemaiaasp and Rf.FS 632. 



Whatever the specific statistical model that is 
applied for prediction using aggregate gene 
expression has important consequences: 
measures of aggregation of expression over a 
group of genes with related profiles can 
reduce dimension (thereby mitigating the 



curse of dimensionality). This can reduce 
multiplicities and, to some degree, ease the 
problems of gene selection, multiple testing 
and co -linearity, while improving signal 
estimation by averaging correlated noise 
components. 



Data visualizations, time series and candidate 
gates. The above discussions of biostatistics 
all assume that the analysis is targeted towards 
a cross-sectional study, in which the primary 
goal is diagnostic gene discovery (gene or fea- 
ture selection). In other words, a series of 
rnicroarrays with a very large number of tran- 
scripts defines the very small minority of 
genes that axe correlated and therefore predic- 
tive of the biological variable of interest. 
There are alternatives to this standard experi- 
mental design that use entirely different types 
of analysis, and the statistical issues are also 
quite different, as explained below. 

The time-series study, if done with enough 
time points, can provide an effective antidote 
to the curse of dimensionality — the action of 
any gene during a time-series study should 
make biological sense, such that each signal is 
relatively easily discernible from noise. Visual 
query of a large time-series data set for single 
gene responses to the controlled variable cither 
might meet expectations and is therefore valid, 
or might not meet expectations and is dis- 
carded as uninteresting. As an example, we 
show a time-series study in which rats are given 
a bolus of methyiprednisolone. after which 
their liver and muscle are studied as a function 
of time (fig. 2). In this case, the same gene (got J) 
is queried using a web-based dynamic visual- 
ization tool, first in liver (FIG. 2a) and then in 
muscle (fig. 2b). The data in the top panel are 
visually compelling; gotl in liver responds 
quickly and strongly to a bolus of 3- methyi- 
prednisolone, with relatively consistent repli- 
cates (each data point comes from a different 
animal) and a time course that is visually assur- 
ing so that complex statistical tests of the tran- 
scriptional response as a function of time are 
not needed. On the other hand, the same gene 
in muscle does not seem to respond to the 
drug 631 (FIG. 2b). Through such gene queries, 
the variability in replicates and the appropri- 
ateness of the action of the gene as a function 
of time can quickly be assessed. Another 
advantage of time-series data is that such pro- 
files act as biomarkers that are amenable to 
analysis and interpretation using pharmacody- 
namic models that predict the underlying 
mechanisms of control of gene expression n . 

The Working Group agreed that data-rich, 
time-series experimental designs provide 
some latitude in reporting significant findings 
and that the query of individual genes within 
large data sets can circumvent complex issues 
of multidinicnsionality of data. 

Future areas of development 

The data-rich and highly dimensional nature of 
microarray data serves as a model for future 
dissection and understanding of biological 
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systems in general, including proteomics and 
integration of mRNA profiling and proteomics. 
The Working Group discussed data analysis 
needs within the microarray community and 
agreed that, along with the incorporation of 
QC SOPs and optimized or customized sig- 
nal/noise analyses in initial project signal gener- 
ation, the back-end statistics needs to reach a 
commonly accepted method of dealing with the 
curse of dimensionality before rnicroarrays can 
be reliably used in clinical trials. Statisticians 
need to focus more on representation of pre- 
diction results in terms of probabilities and 
associated measures of uncertainty, and reach a 
consensus on what is acceptable. In the mean- 
time, it is likely that specific marker or diagnos- 
tic genes will be extracted from pilot profiling 
studies, and then only this small subset of genes 
will be used as a clinical trial endpoint. This data 
limitation approach removes much of the curse 
of dimensionality, but is liable to ignore the 
large majority of data, thereby decreasing the 
potential power of the study and bringing into 
question the use of rnicroarrays in clinical trials. 

A move towards the standardization of 
reporting of prediction accuracy would be 
desirable when assessing predictive accuracy 
through within-sample cross-validation. The 
Working Group suggests that one or more val- 
idation techniques be used when reporting 
predictive genes: leave-one-out and 10% 
cross-validation summaries, or true validation 
data sets. Communicating uncertainty about 
predictive performance is also key and will 
help evaluate results based on varying sample 
sizes. The Working Group suggests that until 
this information is routinely presented in pub- 
lished papers, it will be difficult to reach an 
acceptable consensus for use in clinical trials. 

Conclusions 

There are four key areas of optimization and 
standardization that are largely independent: 
study design, technical variability (QC/SOP of 
data generation), analysis method variation 
(signal/noise optimization using probeset 
algorithms and normalizations) and back-end 
statistical analyses. Statistics of clinical trial 
design is crucial: gene-expression data does 
not mitigate the need for sound and relevant 
design and analysis, nor does it challenge what 
we know about design. The field is quickly 
maturing from the small-chip-number hit- 
and-run type projects to those with a more 
robust study design. However, study design 
depends ultimately on appropriate powering 
of a study, which is greatly affected by both 
the chip-analysis algorithms that are used and 
the biostatistical data analysis. 

Development of back-end statistical meth- 
ods for data representation/summary and for 



high-level analysis remains an active area of 
research for both academic and commercial 
users, and is likely to remain so in the near 
future. We are some way from defining stan- 
dards of summary signal intensities alone 
and even further from considerations of 
standardization of analytical methods for 
inference and prediction in clinical contexts. 
In regulated clinical studies, such standards 
will be enforced partly by the US FDA as sub- 
missions of medical test/device protocols 
emerge and increase in number. Even then, 
however, many approaches to data analysis 
and modelling will be used and developed, 
which is, of course, to be supported. It is very 
difficult to influence the research commu- 
nity, especially when the variety of problems 
that are encountered promotes the need for 
refined and new approaches. 
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FROM DNATO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 



Figure 6-3 Genes can be expressed 
with different efficiencies. Gene A 1$ 
transcribed and translated much more 
efficiently than gene B.This allows the 
amount of protein A In the cell to be 
much greater than that of protein B. 



Portions of DNA Sequence Are Transcribed into RNA 

Hie first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene—into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA — 
the language of a nucleotide sequence. Hence the name transcription. 

like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-4). It differs from 
DNA chemically in two* respects: (1) the nucleotides in RNA are 
ribonucleotides— that is, they contain the sugar ribose Chance the name ribonu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine CA), guanine (G), and cytosine (Q, it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like X can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA On RNA, G pairs with 
C, and A pairs with U). It is not uncommon, however, to find other types of base 
pairs in RNA for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single-stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form fixe final shape of 
a protein (Figure 6-6) . As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates that cause human disease. (A) Schematic Illustration of the type of 
conformational change in a protein that produces material for a cross-beta filament. (B) Diagram illustrating 
the self-Infectious nature of the protein aggregation that It central to prion diseases. PrP Is highly unusual 
because the mlsfolded Version of the protein, called PrP*, induces the normal PrP protein it contacts- to 
change its conformation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggregation, but because this structure Is 
not Infectious In this way, it cannot spread from one animal to another. (Q Drawing of a cross-beta filament, 
a common type of protease-resistant protein aggregate found in a variety of human neurological diseases. 
Because the hydrogen-bond interactions In a 0 sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP*, showing the likely change of two ohelices into four 
p-s$ands.A! though the structure of the normal protein has been determined accurately, the structure of the 
infectious form is not yet known win) certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serp atU adapted from M Sunde et at J. Mo/. Bfoi 273:72^739, 
1 997; D, adapted from &B. Prusiner, Trends Btocnem. Set Zi ;482-487, 1 996.) 



animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by fee spread of BSE (commonly referred 
to as the "mad cow disease") from cattle to humans in Great Britain, 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfbld 
into an infectious conformation, a similar transformation has been discovered 
to be the cause of an otherwise mysterious "protein-only inheritance'' observed 



There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed. 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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ure 6-90) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the Initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step—the transcription of its DNA sequence into an UNA 
molecule. 



Figure 6-90 The production of a 
protein by a eucaryotie celLThe final 
level of each protein In a eucaryotie ceil 
depends upon the efficiency of each step 
depicted 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribon ucleopro tein assembly called a ribosome. The 
amino adds used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair Interactions, par* 
. tlcularsets of three nucleotides in the mRNA (codons). The sequence ofnucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

TO initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs — each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anticodoru Each amino acid is added to the 
C-terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7S Six steps at which 
euearyotxc gene expression can be 
controlled. Controls that operate at 
steps I through 5 are discussed i'd .this 
chapter. Step 6, die regulation of protein 
activity, Includes reversible activation or 
Inactivation by protein phosphoryfadori 
(discussed In Chapter 3) as wefl as 
Irreversible inactivation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 
If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leadmg from DNA to protein, and an of them can in principle be regulated Thus 
a cell can control the proteins it mates by (1) controlling when and how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
txanscnpt is spliced or otherwise processed (RNA processing control}, (3) 
selecting which completed mRNAs mmecefl nucleus are exported to the cytosol 
and determiriing where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
byribosomes (translatioiial control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNAdegradatlon control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5) 

For most genes transcriptional controls are paramount This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
expression. 

» 

Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction of its genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
°£&™z they express in response to changes in their environmen t, such as signals 
from other cells. Although all of the steps involved In expressing a gene can in prin- 
ciple be regulated, for most genes the initiation of RNA transcription is the most 
Mportant point of control 



DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

; How does a cell determine which of its thousands of genes to transcribe? As 
I memloned briefly in Chapters 4 and 6, the trar* crir>tion of each gene is con- 
^ w o areguktoryreglon ofDNA datively near the site where transcription 
r- oeguis. Some regulatory regions are simple and act as switches that are thrown 
goy a single signal. Many others are complex and act as tiny microprocessors, 
fcSfS^ to a of Agnate that they interpret and integrate to switch the 
|neigribormg gene on or off. Whether complex or simple, these switching devices 
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occur In the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive . 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However promoters of genes thai remain active 
in the germ cell lineages * (including most housekeeping genes) are kept 
unmethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired Such regions are preserved in modern day 
vertebrate cells as CG islands* hi addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against and some CG islands are simply the result of a higher than nor* 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 

Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many cell 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 



Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other; this can create a flip-flop switch that switches a cell between two. alternative 
patterns of gene expression. Direct or indirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis for cellular clocks. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of celt in ah igher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes. 

In addition to diffusible gene regulatory proteins, Inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also Junctions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 
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Figure 7-86 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
into CG islands In .vertebrate 
genomes. A black line marks the location 
of a CG dinu'deotf de In the DNA 
sequence* while a red "lollipop" indicates 
the presence of a methyl group on the 
CG dlnudeotide. CG sequences that lie In 
regulatory sequences of genes that are 
transcribed in germ cells are unmethylated 
and therefore tend to be retained In 
evolution. Methylated CG sequences, on 
the other hand, tend to be lost through 
deamination of 5 -methyl C to T, unless the 
CG sequence is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these ptottranscriptional 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional controlt for 
many genes they are crucial 
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Correlation of amplification and overexpression of the c-myc 
oncogene in high-grade breast cancer: FISH, in situ hybridisation 
and immunohistochemical analyses 
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In this study, we analysed gene amplification, RNA expression and protein expression of the c-myc gene on archival tissue specimens 
of high-grade human breast cancer, using fluorescent in situ hybridisation (FISH), nonradioactive in situ hybridisation and 
immunohistochernistry. The specific question that we addressed was whether expression of c-Myc mRNA and protein were 
correlated with its gene copy amplification, as determined by FISH. Although c-Myc is one of the most commonly amplified 
oncogenes in human breast cancer, few studies have utilised in situ approaches to directly analyse the gene copy amplification. RNA 
transcription and protein expression on human breast tumour tissue sections. We now report that by using the sensitive FISH 
technique, a high proportion (70%) of high-grade breast carcinoma were amplified for the c-myc gene, irrespective of status of the 
oestrogen receptor However, the level of amplification was low. ranging between one and four copies of gene gains, and the majority 
(84%) of the cases with this gene amplification gained only one to two copies. Approximately 92% of the cases were positive for c- 
myc RNA transcription, and essentially all demonstrated c-myc protein expression. In fad a wide range of expression levels were 
detected. Statistically significant correlations were identified among the gene amplification indices, the RNA expression scores and 
protein expression scores, c-myc gene amplification, as detected by FISH, was significantly associated with expression of its mRNA. as 
measured by the intensity of in situ hybridisation in invasive cells (P = 0.0067). and by the percentage of invasive cells positive for 
mRNA expression (P = 0,0006). c-myc gene amplification was also correlated with the percentage of tumour cells which expressed 
high levels of its protein, as detected by immunohistochernistry in invasive cells (P = 0.001 6). Thus, although multiple mechanisms are 
known to regulate normal and abenrent expression of c-myc, in this study, where in situ methodologies were used to evaluate high- 
grade human breast cancers, gene amplification of c-myc appears to play a key role in regulating expression of its mRNA and protein. 
British journal of Cancer (2004) 90, 1612- 1619. doi: 1 0. 1 038/sj.bjc.66O 1 703 www.bjcancer.com 
Published online 30 March 2004 
© 2004 Cancer Research UK 
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The c-myc oncogene has been shown to be amplified and/or 
overexpressed in many types of human cancer (Marcu et ah 1992; 
Nass and Dickson, 1997; Nesbit et ah 1999; Liao and Dickson, 
2000). Numerous experiments in vivo have also causally linked 
aberrant expression of this gene to the development and 
progression of cancer in different body sites (Marcu et ah 1992; 
Nass and Dickson, 1997; Nesbit et ah 1999; Liao and Dickson, 
2000). However, several critical issues regarding the significance of 
c-myc in human cancer still remain obscure. First, even for a given 
type of malignancy, the frequencies of the alterations of c-myc at 
the cytogenetic and expression levels vary greatly from one report 
to another (Liao and Dickson, 2000). For instance, the frequencies 
of its amplification, mRNA and protein overexpression in breast 
cancer vary between 1-94, 22-95 and roughly 50-100%, 
respectively, among different reports (Liao and Dickson, 2000). 

^Correspondence: Dr J Blancato; E-mail: blancatj@georgetown.edu 
Received 7 July 2003; revised 1 7 January 2004; accepted 1 7 January 2004; 
published online 30 March 2004 



Thus, it is still unclear to what extent this gene is altered at the 
cytogenetic level and at different expression levels in breast 
carcinoma, 

One controversial issue pertains to the prognostic value of c-myc 
gene alterations in cancer. The central role of c-Myc protein in 
accelerating cell proliferation, documented by many early studies, 
has led to a general concept for many types of cancer that 
amplification or overexpression of this gene may be associated 
with a more aggressive tumour and a poorer patient survival 
(Berns et ah 1992; Marcu et ah 1992; Sato et ah 1995; Nass and 
Dickson, 1997; Nesbit et ah 1999; Visca et ah 1999; Liao and 
Dickson, 2000). However, many reports have shown an opposite 
correlation (Sikora et ah 1985, 1987; Watson et ah 1986; Polacarz 
et ah 1989; Voravud et ah 1989; Williams et ah 1990; Melhem et ah 
1992; Pietilainen et ah 1995; Diebold et ah 1996; Smith and Goh» 
1996; Augenlich et ah 1997; Bieche et ah 1999), while other studies 
do not support either of these conclusions. For instance, gene 
amplification or overexpression of c-Myc protein has also been 
shown to associate with a better tumour differentiation or a better 
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patient survival for cancer of the testis, ovary, bile ducts, colon and 
breast (Sikora et al, 1985, 1987; Watson et al, 1986; Polacarz et al, 
1989; Voravud et al, 1989; Williams et al, 1990; Melhem et al, 1992; 
Pietilainen et al, 1995; Diebold et al, 1996; Smith and Goh, 1996; 
Augenlich et al, 1997; Bieche et al, 1999). This controversy does 
not appear to be related completely to the cancer type, since both 
positive (Berns et al, 1992; Visca et at, 1999) and negative 
(Williams et al, 1990; Melhem et al, 1992; Pietilainen et al, 1995; 
Smith and Goh, 1996; Augenlich et al, 1997; Bieche et al, 1999) 
correlations have been reported for colon cancer and breast 
cancer. More interestingly, c-Myc overexpression has been shown 
to predict a poorer prognosis for cutaneous melanoma, but a 
favourable outcome for uveal melanoma (Grover et al, 1997; Chana 
et al, 1998a, b, 1999; Grover et al, 1999). These data indicate 
different roles of c-Myc> even in the same type of tumour, perhaps 
depending upon different tissue microenvironments. 

Another controversial issue concerns the nuclear -cytoplasmic 
localisation of c-Myc. Studies of neoplasms of the colon, testis, 
ovary and liver have shown that predominandy nuclear localisa- 
tion of c-Myc tends to occur in benign lesions, while cytoplasmic 
localisation tends to occur in more malignant tumours (Sikora 
et al, 1985; Sundaresan et al, 1987; Melhem et al, 1992; Sasano et al, 
1992; Yuen et al, 2001). Whether these patterns of subcellular 
localisation of c-Myc tend to reflect the malignant status of breast 
cancer remains an enigma. 

A recent study of the impact of DNA amplification on gene 
expression patterns in breast cancer used mRNA and DNA from 14 
breast cancer cell lines. Analysis was conducted with a 13000 
cDNA clone array for gene expression measurement and a 
Comparative Genomic Hybridisation (CGH) microarray for gene 
copy number measurements. This study also included known 
breast cancer genes, such as c-myc, HER/2-neu and aibl (Hyman 
et al, 2002). Interestingly, 44% of the most highly amplified genes 
were also overexpressed at the mRNA level. Consistent with this 
pattern, c-Myc gene copy number and its expression levels showed 
a statistically significant (or = 0.020) correlation in this microarray 
study of breast cancer cell lines. Another study, by Pollack and 
colleagues, used microarray analysis and BAC array CGH of RNA 
and DNA (respectively) extracted from intermediate grade human 
breast tissues, and tested for amplification and expression of c- 
Myc (among other genes). This study demonstrated that two out of 
37 specimens were both amplified and overexpressed, while others 
were either amplified or overexpressed, but not both. The authors 
of this study suggested that contaminating stromal tissue may 
compress the fluorescence ratios leading to underestimates of gene 
amplification and overexpression (Pollack et al, 2002), 

To more clearly address the importance of gene amplification 
and expression of c-Myc in human breast cancer, we used in situ 
methodologies, which can clearly distinguish stromal and carci- 
noma components. We studied the amplification and over 
expression of the c-myc gene with fluorescent in situ hybridisation 
(FISH), non-radioactive in situ hybridisation (ISH) and immuno- 
histochemical (IHC) approaches on paraffin-embedded biopsy 
sections of untreated, high-grade breast cancer. It was observed 
that 70, 92 and 70% of the cancer cases exhibited c-myc gene 
amplification, its mRNA overexpression and its protein over 
expression, respectively. In most of the cases (84%) that showed 
gene amplification, the c-myc gene gained only one to two copies, 
which is consistent with c-myc FISH data from other studies. 
Unlike some oncogenes, such as N-myc, which typically demon- 
strates gene amplification copy numbers of greater then 10 in 
neuroblastoma, and HER-2/neu (Sartelet et al, 2002), whose copy 
numbers range up to 14-40 in breast carcinomas (Isola et al, 
1999), gene copy numbers of c-myc are not as greatly increased. In 
the study noted earlier, using breast cancer cell line CGH array and 
cDNA microarray expression analysis, it was demonstrated that 
the most dramatically increased expression levels were associated 
with large gene copy number increases, although low-level gains 



and losses had a significant influence on gene expression 
dysregulation (Hyman et al, 2002), Only one study has been 
published (Pollack et al, 2002) that has begun to determine if these 
findings are direcdy relevant to actual human breast tumour 
tissues, since many of the genetic changes in tissue culture cell 
lines are more extreme than those displayed in primary tumour 
material. Furthermore, the relationships among gene amplifica- 
tion, mRNA expression and c-Myc protein expression were not 
explored in prior human breast cancer cell line and tumour tissue 
studies (Hyman et al, 2002; Pollack et al, 2002). 

In our human breast tumour tissue study, a high correlation was 
found between c-myc FISH and ISH, for both percentage of 
staining (P< 0.0067) and intensity positive cells (P<0.0006). In 
addition, c-myc gene copy amplification by FISH was correlated 
with c-Myc protein expression positive cells by IHC (P< 0.0016). 
These results support the idea that c-Myc overexpression of both 
mRNA and protein is related to the copy number of the c-myc 
DNA amplification. We show in this study that amplification and 
overexpression of c-Myc occur with high frequency in high-grade 
human breast cancer tissues. 



MATERIALS AND METHODS 
Materials 

Formalin- fixed, paraffin-embedded tissue blocks of breast carci- 
noma and normal breast tissue were obtained from the 
His to pathology and Tissue Shared Resource at the Lombard i 
Comprehensive Cancer Center (LCCC), at Georgetown University 
Medical Center. The criteria for tumour selection were the 
following: negative progesterone receptor status, metastases to 
auxiliary lymph nodes and high grade (Elston Score >7). The 
oestrogen receptor status of the tumours was known from archived 
pathology reports. The parameters were chosen from our prior 
meta-analysis (Deming et al, 2000), as indications of a high 
likelihood of c-myc gene amplification. Normal breast tissue 
specimens were from reduction mammoplasty. Serial sections 
(5/tm) for FISH, ISH and IHC were prepared by the LCCC 
Histopathology and Tissue Shared Resource. 

FISH 

A dual-label FISH technique was used (Jenkins et al, 1997). Slides 
were baked overnight at 60 P C to assure adherence of the sample. 
Tissue sections were deparaffinised with two successive, lOmin 
xylene washes, and then dehydrated in a graded ethanol series of 
70, 80 and 95% at room temperature. Samples were then digested 
with 4% pepsin (Sigma, St Louis, MO, USA) at 45°C for lOmin. 
DNA probes used were an alpha satellite probe to chromosome 8, 
labelled with biotin, and a c-myc probe, labelled with digoxigenin 
(Ventana, Tucson, AZ, USA). Codenaturation was performed at 
90° for lOmin on a hot plate. Hybridisation was at 37°C for 12- 
16 h. Detection of signals was accomplished with an antiavidin 
antibody labelled with Texas Red, and an antidigoxigenin antibody 
conjugated to fluorescein (Ventana, Tucson, AZ, USA). Slides were 
postwashed in 2 x SSC at 72°C for 5 min and counterstained with 
DAPI to visualise cell nuclei. Results were viewed and quantified 
with a Zeiss Axiophot fluorescence microscope, equipped with 
appropriate filters and an Applied Imaging Cytovision system 
(Pittsburgh, PA, USA). In this approach, the c-myc unique 
sequence probe was visualised as a green signal and the control 
probe for the chromosome 8 centromere was red, thus easily being 
distinguished when scored. 

One serial section from each tumour sample was stained with 
haematoxylin and eosin and first reviewed by a pathologist (BS), to 
help identify the tumour area of the section. This procedure 
ensured that the tumour cells, but not the normal cells, were 
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counted. Nuclei of up to 50 tumour cells were scored from each 
FISH-stained section, independently by two investigators. Hybri- 
disation signals were averaged, and the amplification index was 
presented as the number of c-myc signals divided by the number of 
chromosome 8 centromere signals. A 1.8-fold increase was used as 
the criterion to judge the presence of c-myc gene amplification. 

In situ hybridisation 

In situ hybridisation (ISH) was carried out with a nonradioactive 
method, described previously (Liao et aU 2000a, b). One serial 
section from each specimen was hybridised overnight at 60°C with 
riboprobes, that were in vitro transcribed from the antisense or 
sense strand of an approximately 300 bp cDNA of human c-myc 
(ATCC, Manassas, VA, USA), labelled with digoxigenin-conjugated 
UTP. The sections were then incubated with an antibody against 
digoxigenin, followed by incubation with a second antibody 
conjugated to alkaline phosphatase. The signal was visualised by 
colour development with 5 bromo-4-chloro-3-indolyI phosphate 
and nitroblue tetrazolium. All reagents were purchased from 
Boehringer Mannheim, Indianapolis, IA. To control the signal 
specificity, two serial sections were mounted on the same slide for 
hybridisation with the antisense and sense probes, respectively. 
ISH was given an intensity and percentage scores, based on 
intensity of positive staining and number of cells staining, 
respectively. Intensity scores were assigned 0, 1, 2 and 3, and 
percentage scores were assigned as 1- 1-25, 2- 26-50, 3- 51-75 
and 4- 76-100%. 

Immunohistochemistry 

Immunohistochemical staining (IHC) was performed using an 
avidin-biotin complex (ABC) method described previously (Liao 
et aU 1998). One serial section of each specimen was deparaffinised 
and blocked with 3% peroxide. Antigens were retrieved by heating 
slides in a microwave oven in 50 mw citrate buffer, pH 6.4, at 
boiling temperature, for 12min. After blocking with 6% normal 
goat serum, the section was incubated with a mouse monoclonal 
antibody to human c-Myc (9E10, Sigma Chemical Company, St 
Louis, MO, USA) at 1:100 dilution for 2h, followed by ih 
incubation with a second antibody conjugated with biotin (Vector 
Laboratories Inc., Burlingame, CA> USA). The section was then 
incubated with peroxidase-conjugated avidin (Dako, Corporation, 
Carpinteria, CA, USA) for 30min, followed by colour development 
with diaminobenzidine and peroxide. All procedures were carried 
out at room temperature. To control the signal specificity, serial 
sections from 10 tumour samples were also stained using an 
alternate c-Myc antibody (CI 9 from Santa Cruz Biotechnology 
Inc., Santa Cruz, CA, USA) at 1 : 60 dilution. This antibody resulted 
in focally positive staining in the tumour, but the staining intensity 
was weaker. To control the signal specificity, serial sections were 
made from five selected positive cases which were subjected to the 
same staining procedure, with a normal mouse IgG to replace the 
c-Myc antibody. This control staining did not give rise to a signal, 
demonstrating the specificity of the c-Myc antibody signal. IHC 
staining was given an intensity and percentage score based upon 
the intensity of positive staining and number of cells staining. 
Intensity scores were assigned 0, 1, 2 and 3 and percentage scores 
were assigned as 1- 1-25, 2- 26-50, 3- 51-75 and 4- 76-100%. 
Determinations were made of cellular localisation of c-Myc 
antibody staining to cytoplasm and/or nucleus in normal and 
invasive cells within each breast tumour specimen. 

Statistical analyses 

For each analysis of gene copy amplification (FISH), mRNA 
expression (ISH) and protein expression (IHC), all cases were first 
grouped as positive or negative to calculate the percentages of 



positive cases and negative cases, as described (Zar, 1974). Fisher's 
exact test was used to compare percentages, and two-sample i-tesi 
or Wilcoxon rank test was used to compare average scores. Both 
ISH and IHC were given intensity and percentage scores, based on 
intensity of positive staining and number of cells staining, 
respectively. As noted earlier, intensity scores were assigned 0, 1, 
2 and 3 and percentage scores were assigned as 1- 1 - 25, 2- 26-50, 
3- 51-75 and 4- 76-100%. A score of >2 for either intensity of 
staining or percentage of cells positive by ISH was assigned as 
high. For IHC, an intensity score of > 1 was assigned as high and a 
percentage score of > 3 was categorised as high. Each amplifica- 
tion index was paired with its corresponding mRNA expression 
score to calculate the coefficient r. The same method was used to 
estimate the association of the amplification indices with the c- 
Myc protein expression levels, and the association of the mRNA 
expression levels with the protein expression levels. A P-value of 
0.05 or less was used to determine the statistical significance in all 
analyses. In all, 54 pairs of normal vs invasive tissues were analysed 
using McNemars x 2 test to determine if there was a difference in 
cellular localisation of c-Myc antibody signal to nuclear or 
cytoplasmic compartments. 



RESULTS 

FISH analysis of gene amplification 

Amplification of the c-myc gene was measured by a FISH test in 46 
cases of breast cancer; Figure 1 demonstrates cells with no 
amplification (one copy of c-myc /one copy of chromosome 8 
centromere, and a moderate amplification a 3/1 ratio). Amplifica- 
tion was calculated by the number of c-myc signals divided by the 
number of chromosome 8 alpha satellite signals. A 1.8-fold 
increase cut-off was used to judge gene amplification. As shown 
in Table 1, 32 out of 46 (70%) cases were gene amplified for c-myc, 
whereas only 30% (14/46) of the cases showed amplification 
indices lower than the cut-off value. The amplification indices for 
most (84%, or 27/32) cases with gene amplification, ranged 
between 1.8- and three- fold, indicating that the locus gained up to 
two copies of c-myc in the majority of the cases. The percentage of 
cases with gene gains of three copies or higher was 1 1% (five out of 
46) of total cases analysed, or near 16% (five out of 32) of the cases 
with gene amplification, including one case (2% of total cases or 
3% of the cases with gene amplification) with the highest index of 5 
(a gain of four copies). 

In all, 28 of the breast carcinomas in this study were ER 
negative, and 14 were ER positive. The average c-myc gene 
amplification score was 1.896 (s.e. = 0.196) for ER positive and 
2.201 (s.e. = 0.157) for ER negative. Although ER-negative tumours 
had a slightly higher average c-myc score, the difference was not 
statistically significant (two-sided P = 0.252 from two-sample Mest 
and 0.251 from Wilcoxon rank test), consistent with the results of 
our prior meta-analysis of the literature (Deming et aU 2000). 



In situ hybridisation analysis of c-myc mRNA expression 

A total of 51 breast cancer samples were studied for c-Myc mRNA 
expression, with non radioactive in situ hybridisation (ISH). ISH 
results were assigned intensity and percentage scores based upon 
signal intensity of positive staining and number of cells staining 
within the sample, respectively. As shown in Table 2, 86% (44 out 
of 51) tumours were scored as high in intensity, and 92% (47out of 
51) had more than 51% positive cells, also considered as highly 
increased c-Myc expression. mRNA expression was heterogeneous 
in the breast tumour tissue, and no morphologic subtype was 
predominant in the high or low categories. One case showed no c- 
Myc ISH staining. In 79% (38/48) of cases, epithelia in normal 
mammary glands adjacent to the tumour also showed a high 
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Figure I FISH analysis of c-myc amplification in tumour cells from breast tumour tissue sections. FISH probe for human c-myc unique- sequence is seen as 
green, while the normal control signal, a centromeric probe signal for chromosome 8 is shown in red. The nuclei of tumour cells were visualised by DAPI 
counter-staining. (A) 1:1 copy ratio of c-myc to chromosome 8 (c-myc/8 centromere), indicating no amplification of c-myc in tumour cells. (B) 1:3 copy 
ratio of c-myc to chromosome 8 (c-myc/8 centromere), a moderate amplification of the c-myc gene. 



Table I c-myc gene copy amplification analysis by RSH in poor 
prognosis human breast tumour samples 



Amplification index 




(#c-myt signal s/# 


Percentage of samples with 


control signals)-*' 


FISH ratios in each category 


1.0-17 


30% 14 out of 46 


1.8-1.99 


20% Nine out of 46 


2.0-2.9 


39% 18 out of 46 


>3.0 


11% Five out of 46 



Analysis was conducted on 46 individual paraffin -embedded tissue samples with 
negative progesterone receptor status, positive lymph node involvement and high 
tumour grade. + Normal control ratio is I. 



intensity of staining. In three cases, no staining was seen in the 
normal terminal duct lobular units. Figure 2 shows representative 
fields of high, medium and low c-myc mRNA expression levels in 
invasive ductal carcinoma samples. 

Association of FISH and ISH 

c-Myc scores were dichotomised as binary variables (high or low), 
and a score of 2 or higher was categorised as high on ISH. A score 
higher than median was categorised as high from FISH studies. 
These dichotomised scores are depicted in Table 3. A Fisher's exact 
test was performed for comparing binary responses to see if there 
was any association between FISH and ISH. It was found that the 
FISH score was significantly associated with percentage of staining 
in the invasive cells (P = 0.0067, two-sided McNemar's test) and 
also with the intensity score on ISH (P = 0.0006, two-sided). 



Immunohistochemical staining of c-Myc proteins 

In total, 51 breast carcinomas, which were subjected to FISH 
analysis, and all of which also had been analysed for c-myc mRNA 
by in situ hybridisation, were also analysed for the expression of c- 
Myc protein, using immunohistochemical staining with the 9E10 
antibody. IHC results were assigned an intensity and percentage 
score based on intensity of positive staining and number of cells 
staining, respectively. Intensity scores were assigned 0, 1, 2 and 3 
and percentage scores were assigned as 0, 1- 0-25, 2- 26-50, 3- 



51-75 and 4- 76-100. For IHC, an intensity score of >1 was 
assigned as high and a percentage score of > 3 was categorised as 
high. Figure 2 shows examples of high, medium and low levels of c- 
myc antibody staining in invasive ductal carcinoma samples. In 34 
cases, normal tissue was seen; 30 of these showed cytoplasmic 
staining and 22 had nuclear staining in terminal ductal lobular 
units. In all, 12 cases showed 1 + , 14 cases 2 + and four cases 3 + 
cytoplasmic staining. In situ hybridisation revealed positive 
staining in 46 out of 49 cases with normal tissue. Seven cases 
showed I-K 13 cases showed 2 4- and 26 cases showed 3 + 
staining by ISH. Both immunohistochemistry and in situ 
hybridisation showed diffuse positivity in adipocytes. 

Table 4 shows the staining pattern for the cohort In all, 70% (36 
out of 51) of cases showed high intensity of staining for c-Myc 
protein, while 85% (29 out of 34) of cases with detectable staining 
had more than 76% positive cells, also considered as high 
expression. To verify the staining specificity, serial sections from 
10 tumour specimens that were positive for 9E10 antibody were 
also stained using the CI 9 rabbit polyclonal anti-c-Myc antibody. 
Results revealed a staining pattern similar to 9E10. However, the 
staining intensity with C19 was weaker than 9E10. The specificity 
of these two antibodies was verified by Western blots in previous 
studies (Persons et al t 1997; Liao et aU 2000b). Figure 2 shows 
results of c-Myc in situ hybridisation and immunohistochemistry 
studies on samples considered to demonstrate low, moderate and 
high levels of c-Myc expression. Analysis of c-Myc protein 
localisation results in the nucleus or cytoplasmic compartments 
of normal and invasive cells within the tumours revealed that 
nuclear staining was positive in 41% of normal cells, compared to 
22% of invasive cells (statistical significance at P=0.01 by 
McNemar's two-sided % 2 test). The increase in relative cytoplasmic 
localisation of c-Myc protein, comparing normal (53.7%), to 
invasive cells (61.1%) was not significandy different. Thus, the data 
are consistent with partial exclusion of c-Myc from the nuclei of 
invasive breast cancer cells. 

The FISH score was significantly associated with the percentage 
positivity of invasive cells, as seen on IHC studies of c-Myc. 
However, 40% of tumours displayed a low index of c-myc gene 
amplification, but still expressed high levels of c-Myc protein 
(Table 6), indicating the possibility of other mechanisms of over 
expression unrelated to gene amplification in at least some 
tumours. The FISH score was not significantly associated with 
the intensity of IHC staining in the invasive cells (not shown), in 
contrast to the IHC percentage positivity score. 
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Table 2 c-myc mRNA in situ hybridisation (ISH) results 



Staining intensity 


0 12 3 Percent posftivlty 12 3 4 


Number of tumour samples in each category N = 5 1 


1 6 25 19 Number of tumour samples in each level category N - 5 1 1 3 5 42 



In all 51 human high-grade breast carcinomas were analysed to determine the relationships between c-Myc mRNA expression and c-myc gene in situ hybridisation results. Data 
are shown in two ways in the above table. First overall staining intensity of c-Myc-poshive cells was scored as 0, I, Z 3 (low to high), and the number of tumour samples at each 
level of staining indicated on the fine below. Next, the percentage of tumour cells staining was scored as 0. I, Z 3. 4 {low to high %, as discussed in Materials and Methods). The 
number of tumours at each level of percent cell positivity for c-Myc is then indicated on the line below. 




Figure 2 Immunohistochemical staining and in situ hybridisation for c-Myc of three sets of invasive ductal carcinoma. (A, C and E) High (3 + ). 
intermediate (2 + ) and low ( 1+ ) level of staining by immunohistochemistry for c-Myc (B, D and F) High (3 + ). intermediate (2 + ) and low ( I 4- ) level of 
staining by in situ hybridisation. 



DISCUSSION 

Although there have been many reports on c-myc amplification in 
human breast cancer (Liao and Dickson, 2000), there are only two 
published studies involving application of the FISH technique to 
unfixed, frozen sections (Persons et aU 1997; Visscher et al t 1997), 
and one prior study using FISH on an archival human tissue 
microarray (Schraml et al, 1999). Another recent study applied 
FISH to evaluate c-myc amplification in ductal carcinoma in situ 



(DCIS) (Aulmann et al t 2002). Using the FISH technique on 
formalin-fixed, paraffin-embedded sections, we now show that 
70% of high-grade breast cancer samples bear c-mye gene copy 
amplifications. Interest! ngly, the above-mentioned study, using 
FISH and focusing on DCIS, detected amplification of c-myc in 
only 20% of cases, but found a correlation of c-myc with increased 
tumour size and proliferation (Aulmann et al, 2002). 

The level of amplification of c-myc in our study ranged between 
one and four additional copies of the gene; the majority (84%) of 
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the cases with the gene amplification gained only one to two 
copies, also consistent with FISH data reported for c-myc copy 
amplification in human metastatic prostate carcinoma tissues 
(Jenkins et ah 1997). The relationship between the level of c-myc 
gene copy amplification and the level its increased mRNA 
expression has been examined previously in breast cancer cell 
lines (Hyman et ah 2002). In general, it has been concluded that 
the two scores coordinate for c-myc, as is the case for many breast 
cancer genes. However, only 44% of the highly amplified genes, in 
general, showed increased RNA expression, and only 10.5% of the 
highly overexpressed genes were gene copy-amplified in the cell 
line study (Hyman et ah 2002). Another analysis was conducted to 
study of relationships between gene amplification and expression 
of 6095 genes in 37 intermediate grade human breast tumours. 
This study demonstrated that 62% of the highly amplified genes 
also showed elevated expression; overall, a two-fold change in 
DNA copy number was associated with a i. 5-fold change in mRNA 
levels. Overall, 12% of the variation in gene expression in the 
breast tumours studied was associated with gene copy number 
variation (Pollack et ah 2002). Further study of additional human 
breast tumours, at precisely defined grades and stages, will be 
necessary in order to more fully define the relationships between 
DNA copy numbers and expression of genes. The studies we report 
here indicate higher levels of c-Myc gene amplification and 
expression, than other previous reports in breast cancer. We 
believe that this is probably the result of our analysis of individual 
tumour cells in a well-defined set of high-grade breast tumours. 
Prior c-Myc expression and amplification microarray studies used 
tumour specimens which contain normal stromal components, 



Table 3 Correlations between c-myc gene copy number (RSH) mRNA 
expression (ISH) 



FISH 



potentially underestimating amplification and expression levels of 
the invasive tumour components (Pollack et ah 2002). 

Our study reports a percentage of tumours gene amplified for c- 
myc (using FISH in high-grade tumours) that is much higher than 
the average figure (15.5%) reported in the literature (lsola et ah 
2002). Most of the prior studies have employed the relatively 
insensitive Southern blot technique, and were reviewed in a 
recent meta-analysis (Deming et ah 2000). Consistent with this 
prior literature background, a recent study of 94 lobular and 
ductal breast cancers assessed amplification of c-myc by 
using a semiquantitative PCR assay and protein expression, with 



Table 5 Nuclear/cytoplasmic localisation of c-Myc comparing normal 
and invasive ceils 



Normal cells 
(frequency percent) 


Invasive cells 
(frequency percent) 




Total 


(A) Nuclear localisation 




+ 






28 




32 


+ 


14 


8 


22 


Total 


42 


12 


54 


(B) Cytoplasmic localisation 










12 


13 


25 


+ 


9 


20 


21 


Total 


21 


33 


S4 



In all. 54 pairs {normal vs invasive) of tissues were analysed to answer the questions of 
(I) whether positivity of nuclear ceHs in normal tissues is different from that in 
invasive cells, and simBariy (2) whether positivity of cytoplasmic cells in normal tissues 
is different from that in invasive cells. The data are summarised in the above 
contingency tables. In all, 72 normal cell specimens were positive for c-Myc staining 
(40.71%). compared to 12 specimens (22.2%) in invasive cells. The difference is 
statistically significant (P = 0.0l) by McNemar's x 3 test (two-sided). 



Low 



High 



(A) ISH {% cells) 

Low 

High 



(B) ISH (intensity) 

Low 

High 



f 

\9 



2 

18 



3 
18 

P = 0.0067 



5 
/6 

P = 0.0006 



Serial sections of high-grade human breast carcinomas were scored for c-myc gene 
copy number (RSH, Table 1) and mRNA expression (ISH, Table 2). In (A), a positive 
correlation (P- 0.0067) was observed between tumour samples with a high 
percentage of cells demonstrating mRNA expression and a high c-myc gene copy 
number. A score of 2 or higher was classified as high on ISH, and a score of median or 
greater was categorised as high on FISH. In (B). a positive correlation (P = 0.0006) 
was shown between a high level of intensity for c-Myc RNA expression a high and c- 
myc gene copy number. Note that a pairwise comparison of RSH and ISH was not 
possible for all cases, due to incomplete overlap of cases analysed with each assay. 



Table 6 Correlation between c-Myc protein expression (HC) and c- 
myc gene copy number (RSH) 



1HC (% celts) 




FISH 




Low 




High 


Low 


3 




0 


High 


10 




15 








P = 0.00 16 



Consecutive serial sections of high-grade human breast tumours were scored for c- 
myc gene copy number or protein expression, by immunohistochemistry (IHC). 1HC 
scores were defined in the Materials and methods section. Data were analysed for 
correlations between the results. A highly significant correlation was observed 
between high c-Myc protein expression (IHC) between percent cells positive and 
high c-myc gene amplification (FISH). P = 0.00 1 6 from two-sided McNemar's test. 
Note that for IS cases, no staining for c-Myc could be detected; these negative cases 
were not included in the correlation presented above. 



Table 4 c-Myc immunohistochemistry (IHC) results 



Staining intensity 


0 


1 


2 


3 


Percent positivity 


\ 


2 


3 


4 


Number of tumour samples in each category 


15 


13 


20 


3 


Number of tumour samples in each category 


2 


2 


1 


29 



In all. 5 1 high-grade human breast carcinomas were analysed to determine the relationships between c-Myc protein expression and c-myc gene in situ hybndisation results. Data 
are shown in two ways in the above table First, overall staining intensity of c-Myc-positive celts was scored as 0, 1 . 2. 3 (low to high), and the number of tumour samples at each 
level of staining is indicated on the line below. Next, in a random subset of these cases, the percentage of tumour cells staining was scored as 0. I. 2, 3. 4 (low to high %, as 
discussed in Matenals and methods). The number of tumours at each level of percent cell positiviiy for c-Myc is indicated on the line below. 
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densitometry, after Western blot. These data showed c-myc gene 
amplification in 21% of tumours (Jenkins et a/, 1997), using assays 
not based on in situ discrimination of tumour vs nontumour cells. 
The lower frequency of c-myc in this prior study is in contrast with 
the data we present here, and could be the result of the higher 
sensitivity and precision of the FISH and immunohistochemical 
methods, as distinct from quantitative PCR and Western blot 
densitometry. In addition, the 70% of amplified tumours in our 
study is also much higher than the 12% reported by Schraml et al 
(1999), using a c-myc FISH test on a tissue rnicroarray. This large 
difference may be because the arrays are prepared from cores of 
paraffin- embedded tissue, as small as 0.6 mm in diameter which 
may contain too few tumour cells for complete analysis of 
amplification of a gene, such as c-myc. c-myc is known to be 
quite heterogeneous in its gene amplification within individual 
tumours (in contrast to HER2/neu t for example) (Persons et al> 
1997). 

Most previous reports on the expression of c-myc mRNA have 
utilised Northern blot, dot blot or PCR-based approaches, while 
just a few involved in situ hybridisation, which were primarily 
performed on frozen tissue sections (Liao and Dickson, 2000). 
Normal breast tissue is dominated by adipose cells, differing 
greatly from tumour tissue in its epithelial cellularity. Thus, 
normal and tumour tissues may not be rigorously compared by 
techniques involving RNA extraction from total tissue. Therefore, 
conclusions such as 'increased expression 1 may be more difficult to 
make from studies with Northern blot, dot blot and PCR-based 
techniques that require RNA extraction from tissues that have not 
been fastidiously micro-dissected for selection of tumour cells. 
Using a more sensitive, nonradioactive in situ hybridisation (ISH) 
approach on formalin-fixed, paraffin-embedded sections, we 
report herein high expression of c-myc mRNA in 92% of high- 
grade breast carcinomas. This figure is much higher than the 
recently reported data (22%), obtained by using a real-time RT- 
PCR method (Bieche et at> 1999). Dilution of the RNA from 
epithelium by the RNA from adipose in normal breast tissue in this 
latest prior report may be one of the possible explanations for this 
large difference. 

In conclusion, the present study shows that approximately 70, 92 
and 70% of biopsies of untreated high-grade breast cancer exhibit 
c-myc gene amplification, mRNA overexpression and protein 
overexpression, respectively. In most cases (84%), with gene copy 



amplification, the c-myc gene gains one to two additional copies, c- 
myc gene amplification was significantly associated with expres- 
sion of its mRNA (both by intensity in invasive cells and by 
percentage positivity in invasive cells), and with expression of its 
protein (by percentage positivity in invasive cells). However, our 
data were also consistent with the prior literature on c-Myc 
(reviewed in Nass and Dickson, 1997; Liao and Dickson, 2000), 
indicating complex transcriptional, post transcriptional, transla- 
tional and post-translational control of c-Myc expression in vitro. 
Specifically, in Table 5 we observed that in 40% of the high-grade 
tumours tested, c-Myc protein was expressed at high levels, despite 
a lack of its gene amplification. 

It will be interesting to analyse lower grade tumours and 
premalignant lesions, with the same measurement tools, to 
determine if this c-myc amplification pattern is different, 
comparing different steps in onset and progression of the 
disease. Specifically, prior studies in fibroblasts and in human 
mammary epithelial cells (Liao et al, 1998, 2000a, b) have 
demonstrated that only a subtle deregulation of expression of c- 
Myc is sufficient to allow genomic instability. These prior cell 
biologic findings raise the question of whether c-Myc protein 
expression precedes or follows its gene amplification during the 
course of die natural history of breast cancer. It will also be 
interesting for future studies of lower grade breast cancers and 
premalignant lesions to determine whether there is evidence of 
nuclear exclusion of c-Myc protein. Indeed, nuclear exclusion 
of c-Myc in high-grade tumours could serve to attenuate its 
functions in later stages of disease progression (Liao and Dickson, 
2000). 
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primers and analyzing expression of the DNA sequences of interest in normal and tumor tissues. 
The analyses were designed to determine whether a difference exists between gene expression in 
normal tissues as compared to tumor in the same tissue type. 
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Was responsible for numerous cloning projects including: studies of somatic hypermutation, 
studies of AIDS-associated lymphomas, and cloning of t(5; 14), t(l 1 ; 14), and t(8; 14) 
translocations. Focused on the activation of hemopoietic growth factors involved in the t(5;14) 
translocation in leukemia patients.. 
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Worked on a subunit porcine vaccine directed against Mycoplasma hyopneumoniae. Was 
responsible for generating genomic libraries, screening with degenerate oligonucleotides, and 
characterizing and expressing clones in E. coli. Also constructed a general purpose expression 
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DECLARATION OF PAUL POLAKIS, PhD. 
L Paul Polakis, Ph.D., declare and say as follows: , 

1. I was awarded a PhD by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. S ince joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary foCus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins, that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins".. When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treaunent of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins ' 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 



expressed from that mRNA in that cell type. In approximately 80% of our . 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 

. opinion that for human genes, an increased level of mRNA in a tumor cell relative 
. to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of , 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardi2e the validity of the application or any patent issued 
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Staff Scientist 

Genentech, Ihc 

1 DNA Way, MS#40 

S. San Francisco, CA 94080 

EDUCATION: 

Ph.D., Biochemistry, Department of Biochemistry, 
Michigan State University (1984) 

B.S., Biology. College of Natural Science, Michigan State University (1977) 
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PROFESSIONAL EXPERIENCE: 

2002-present Staff Scientist, Genentech, Inc 

S. San Francisco, CA 



Senior Scientist, Genentech, Inc., 
S. San Francisco, CA 



Research Director 
Onyx Pharmaceuticals, Richmond, CA 



Senior Scientist, Project Leader, Onyx 
Pharmaceuticals, Richmond, CA 



Senior Scientist, Chiron Corporation, 
Emeryville, CA. 

Scientist, Cetus Corporation, Emeryville CA, 

Postdoctoral Research Associate, Genentech, 
Inc., South SanFrancisco, CA. 



1999- 2002 

1997 -1999 

H 

< • 

1992-1996 

1991-1992 

1989-1991 
1987-1989 



1985-1987 



Postdoctoral Research Associate, Department 
of Medicine, Duke University Medical Center, 
Durham, NC 



1984-1985 



Assistant Professor, Department of Chemistry, 
Oberiin College, Oberiin, Ohio 



1980-1984 Graduate Research Assistant, Department of 

Biochemistry, Michigan State University 
East Lansing, Michigan 
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Lfjacts. If these minor cell proteins differ among cells to the same extent as the 
jjjore abundant proteins, as is commonly assumed, only a small number of pro- 
tein differences (perhaps several hundred) suffice to create very large differences 
In cell morphology and behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to External Signals 3 

Most of the specialized cells in a multicellular organism are capable of altering 
tbeir patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, the production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal the liver to increase the 
production of glucose from amino acids and other small molecules; the set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longerpr esent » the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of ceD specializations-different cell types often respond in dif- 
ferent ways to the same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc- 
tive character. These features reflect the persistent expression of different sets of 
genes. 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 4 

If differences between the various cell types of an organism depend on the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them can in principle be regulated. Thus a cell can control the proteins it 
makes by (1) controlling when and how often a given gene is transcribed (tran- 
scriptional control), (2) controlling how the primary RNA transcript is spliced or 
otherwise processed (RNA processing control), (3) selecting which completed 
niRNAs in the cell nucleus are exported to the cytoplasm (RNA transport con- 
trol), (4) selecting which mRNAs in the cytoplasm are translated by ribosomes 
(translational control), (5) selectively destabilizing certain mRNA molecules in 
the cytoplasm (mRNA degradation control), or (6) selectively activating, inacti- 
vating or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figure 9-2). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 9-2, only transcrip- 
tional control ensures that no superfluous intermediates are synthesized. In the 
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Figure 9-2 Six steps at which 
eucaryote gene expression can be 
controlled. Only controls that operate 
at steps 1 through 5 are discussed in 
this chapter. The regulation of protein 
activity (step 6) is discussed in 
Chapter 5; this includes reversible 
activation or inactivation by protein 
phosphorylation as well as 
irreversible inactivation by proteolytic 
degradation. 
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following sections we discuss the DNA and protein components that regulate the 
initiation of gene transcription. We return at the end of the chapter to the other 
ways of regulating gene expression. 

Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction ofits genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals from 
other cells. Although all of the steps involved in expressing a gene can in principle be 
regulated, for most genes the initiation of RNA transcription is the most important 
point of control 



DNA-binding Motifs in Gene 
Regulatory Proteins 5 

How does a cell determine which ofits thousands of genes to transcribe? As dis- 
cussed in Chapter 8, the transcription of each gene is controlled by a regulatory 
region of DNA near the site where transcription begins. Some regulatory regions 
are simple and act as switches that are thrown by a single signal. Other regula- 
tory regions are complex and act as tiny microprocessors, responding to a vari- 
ety of signals that they interpret and integrate to switch the neighboring gene on 
or off. Whether complex or simple, these switching devices consist of two fun- 
damental types of components: (1) short stretches of DNA of defined sequence 
and (2) gene regulatory proteins that recognize and bind to them. 

We begin our discussion of gene regulatory proteins by describing how these 
proteins were discovered. 

Gene Regulatory Proteins Were Discovered Using 
Bacterial Genetics 6 

Genetic analyses in bacteria carried out in the 1950s provided the first evidence 
of the existence of gene regulatory proteins that turn specific sets of genes on 
or off. One of these regulators, the lambda repressor, is encoded by a bacterial 
virus, bacteriophage lambda. The repressor shuts off the viral genes mat code for 
the protein components of new virus particles and thereby enables the viral ge- 
nome to remain a silent passenger in the bacterial chromosome, multiplying with 
the bacterium when conditions are favorable for bacterial growth (see Figure 
6-80). The lambda repressor was among the first gene regulatory proteins to be 
characterized, and it remains one of the best understood, as we discuss later. 
Other bacterial regulators respond to nutritional conditions by shutting off genes 
encoding specific sets of metabolic enzymes when they are not needed. The lac 
repressor, for example, the first of these bacterial proteins to be recognized, turns 
off the production of the proteins responsible for lactose metabolism when this 
sugar is absent from the medium. 

The first step toward understanding gene regulation was the isolation of 
mutant strains of bacteria and bacteriophage lambda that were unable to shut 
off specific sets of genes. It was proposed at the time, and later proved, that most 
of these mutants were deficient in proteins acting as specific repressors for these 
sets of genes. Because these proteins, like most gene regulatory proteins, are 
present in small quantities, it was difficult and time-consuming to isolate them. 
They were eventually purified by fractionating cell extracts on a series of stan- 
dard chromatography columns (see pp. 166-169). Once isolated, the pro- 
teins were shown to bind to specific DNA sequences close to the genes that they 
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Figure 9-3 Double-helical structure 
of DNA. The major and minor grooves 
on the outside of the double helix are 
indicated. The atoms are colored as 
follows: carbon, dark blue; nitrogen* 
light blue; hydrogen, white; oxygen ; 
red; phosphorus, yellow. 
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Figure 9-71 A mechanism to explain 
both the marked deficiency of CG 
sequences and the presence of CG 
islands in vertebrate genomes. A 

black line marks the location of an 
unmethylated CG dinucleotide in the 
DNA sequence, while a red line marks 
the location of a methylated CG 
dinucleotide. 
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Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many spe- 
cialized animal cells can maintain their unique character when grown in culture, the 
gene regulatory mechanisms involved in creating them must be stable once estab- 
lished and heritable when the cell divides, endowing the cell with a memory of its 
developmental history. Procaryotes and yeasts provide unusually accessible model 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
evant to the creation of specialized celt types in higher eucaryotes. One such mecha- 
nism involves a competitive interaction between two (or more) gene regulatory pro- 
teins, each of which inhibits the synthesis of the other, this can create a flip-flop 
switch that switches a cell between two alternative patterns of gene expression. Di- 
rect or indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism for cell memory. 

In eucaryotes gene transcription is generally controlled by combinations of gene 
regulatory proteins. It is thought that each type of cell in a higher eucaryotic organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of cell A given gene regulatory pro- 
tein may be expressed in a variety of circumstances and typically is involved in the 
regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utilized by eucaryotic cells to regulate gene expression. In ver- 
tebrates DNA methylation also plays a part, mainly as a device to reinforce decisions 
about gene expression that are made initially by other mechanisms. 



Posttranscriptional Controls 

Although controls on the initiation of gene transcription are the predominant 
form of regulation for most genes, other controls can act later in the pathway 
from RNA to protein to modulate the amount of gene product that is made. Al- 
though these posttranscriptional controls, which operate after RNA polymerase 
has bound to the gene's promoter and begun RNA synthesis, are less common 
toan transcriptional control fox many genes they are crucial. It seems that every 
step in gene expression that could.be controlled in principle is likely to be regu- 

a ted under some circumstances for some genes. 

We consider the varieties of posttranscriptional regulation in temporal or- 

er ! acc °rding to the sequence of events that might be experienced by an RNA 
Molecule after its transcription has begun (Figure 9-72). 
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Figure 6-3 Genes can be expressed 
with different efficiencies* Gene A Is 
transcribed and translated much more 
efficiently than gene B.This allows the 
amount of protein A in the cell to be 
much greater than that of protein B. 
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FROM DNA TO RNA 

Transcription and translation are the means by which cells read out, or express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency, allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 



Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence — a gene — into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA— 
the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by phosphodiester bonds (Figure 6-^). It differs from 
DNA chemically in twcT respects: (1) the nucleotides in RNA are 
ribonucleotides— that is, they contain the sugar ribose (hence the name ribonu- 
cleic acid) rather than deojcyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine (G), and cytosine (C), it contains the base uracil (U) 
instead of the thymine (T) in DNA. Since U, like T, can base-pair by hydrogen- 
bonding with A (Figure 6-5), the complementary base-pairing properties 
described for DNA in Chapters 4 and 5 apply also to RNA (in RNA, G pairs with 
C, and A pairs with U). It is not uncommon, however, to find other types of base 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall structure. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single -stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6). As we see later in this chapter, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have structural 
and catalytic functions. 



Transcription Produces RNA Complementary to 
One Strand of DNA 

All of the RNA in a cell is made by DNA transcription, a process that has cer- 
tain similarities to the process of DNA replication discussed in Chapter 5. 
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Figure 6-89 Protein aggregates that cause human disease. (A) Schematic illustration of the type of 
conformational change in a protein that produces material for a cross-beta filament (B) Diagram illustrating 
the self-infectious nature of the protein aggregation that Is central to prion diseases. PrP is highly unusual 
because the misfolded version of the protein, called PrP*, induces the normal PrP protein it contacts* to 
change its conformation, as shown. Most of the human diseases caused by protein aggregation are caused by 
the overproduction of a variant protein that is especially prone to aggregation, but because this structure Is 
not infectious in this way, it cannot spread from one animal to another. (C) Drawing of a cross-beta filament, 
a common type of protease-resistant protein aggregate found in a variety of human neurological diseases. 
Because the hydrogen-bond interactions in a p sheet form between polypeptide backbone atoms (see Figure 
3-9), a number of different abnormally folded proteins can produce this structure. (D) One of several 
possible models for the conversion of PrP to PrP*, showing the likely change of two ot-helices into four 
p-strands. Although the structure of the normal protein has been determined accurately, the structure of the 
infectious form Is not yet known with certainty because the aggregation has prevented the use of standard 
structural techniques. (C, courtesy of Louise Serpell, adapted from M. Sunde et al.,J. Mol Biol 273:729-739, 
1 997; D, adapted from S.B. Prusiner, Trends Brodiem. Sd. 21 :482-487, 1 996.) 

animals and humans. It can be dangerous to eat the tissues of animals that con- 
tain PrP*, as witnessed most recently by the spread of BSE (commonly referred 
to as the "mad cow disease") from catde to humans in Great Britain. 

Fortunately, in the absence of PrP*, PrP is extraordinarily difficult to convert 
to its abnormal form. Although very few proteins have the potential to misfold 
into an infectious conformation, a similar transformation has been discovered 
to be the cause of an otherwise mysterious "protein-only inheritance" observed 
in yeast cells. 

There Are Many Steps From DNA to Protein 

We have seen so far in this chapter that many different types of chemical reac- 
tions are required to produce a properly folded protein from the information 
contained in a gene (Figure 6-90). The final level of a properly folded protein in 
a cell therefore depends upon the efficiency with which each of the many steps 
is performed. 

We discuss in Chapter 7 that cells have the ability to change the levels of 
their proteins according to their needs. In principle, any or all of the steps in Fig- 
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Figure 6-90 The production of a 
protein by a eucaryotk cell. The final 
level of each protein in a eucaryotic cell 
depends upon the efficiency of each step 
depicted. 
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ure 6-90) could be regulated by the cell for each individual protein. However, as 
we shall see in Chapter 7, the initiation of transcription is the most common 
point for a cell to regulate the expression of each of its genes. This makes sense, 
inasmuch as the most efficient way to keep a gene from being expressed is to 
block the very first step— the transcription of its DNA sequence into an RNA 
molecule. 



Summary 

The translation of the nucleotide sequence of an mRNA molecule into protein takes 
place in the cytoplasm on a large ribonucleoprotein assembly called a ribosome. The 
amino acids used for protein synthesis are first attached to a family of tRNA 
molecules, each of which recognizes, by complementary base-pair interactions, par- 
ticular sets of three nucleotides in the mRNA (codons). The sequence of nucleotides in 
the mRNA is then read from one end to the other in sets of three according to the 
genetic code. 

To initiate translation, a small ribosomal subunit binds to the mRNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal subunit binds to complete the ribosome and begin the elongation 
phase of protein synthesis. During this phase, aminoacyl tRNAs—each bearing a 
specific amino acid bind sequentially to the appropriate codon in mRNA by forming 
complementary base pairs with the tRNA anticodon. Each amino acid is added to the 
C-terminal end of the growing polypeptide by means of a cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryotic gene expression can be 
controlled. Controls that operate at 
steps f through 5 are discussed ip this 
chapter. Step 6, the regulation of protein 
activity, includes reversible activation or 
inactlvation by protein phosphorylation 
(discussed in Chapter 3) as well as 
irreversible inactlvation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various cell types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
a cell can control the proteins it makes by (1) controlling when and how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosol 
and detennining where in the cytosol they are localized (RNA transport and 
localization control), (4) selecting which mRNAs in the cytoplasm are translated 
byribosomes (translational control), (5) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (6) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific protein 
molecules after they have been made (protein activity control) (Figure 7-5). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize superfluous interme- 
diates. In the following sections we discuss the DNA and protein components 
that perform this function by regulating the initiation of gene transcription. We 
shall return at the end of the chapter to the additional ways of regulating gene 
expression. 

Summary 

The genome of a cell contains in its DNA sequence the information to make many 
thousands of different protein and RNA molecules. A cell typically expresses only a 
fraction of its genes, and the different types of cells in multicellular organisms arise 
because different sets of genes are expressed. Moreover, cells can change the pattern 
of genes they express in response to changes in their environment, such as signals 
from other cells. Although all of the steps involved in expressing a gene can in prin- 
ciple be regulated, far most genes the initiation of RNA transcription is the most 
important point of control 



DNA-BINDING MOTIFS IN GENE REGULATORY 
PROTEINS 

How does a cell determine which of its thousands of genes to transcribe? As 
mentioned briefly in Chapters 4 and 6, the transcription of each gene is con- 
tolled by a regulatory region of DNA relatively near the site where transcription 
begins. Some regulatory regions are simple and act as switches that are thrown 
; by a single signal. Many others are complex and act as tiny microprocessors, 
tesjponding to a variety of signals that they interpret and integrate to switch the 
"^ghboring gene on or off. Whether complex or simple, these switching devices 
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occur in the germ line, the cell lineage that gives rise to sperm or eggs. Most of 
the DNA in vertebrate germ cells is inactive and highly methylated. Over long 
periods of evolutionary time, the methylated CG sequences in these inactive 
regions have presumably been lost through spontaneous deamination events 
that were not properly repaired. However promoters of genes that remain active 
in the germ cell lineages (including most housekeeping genes) are kept 
unmethylated, and therefore spontaneous deaminations of Cs that occur with- 
in them can be accurately repaired. Such regions are preserved in modern day 
vertebrate cells as CG islands. In addition, any mutation of a CG sequence in the 
genome that destroyed the function or regulation of a gene in the adult would be 
selected against, and some CG islands are simply the result of a higher than nor- 
mal density of critical CG sequences. 

The mammalian genome contains an estimated 20,000 CG islands. Most of 
the islands mark the 5' ends of transcription units and thus, presumably, of 
genes. The presence of CG islands often provides a convenient way of identify- 
ing genes in the DNA sequences of vertebrate genomes. 



Summary 

The many types of cells in animals and plants are created largely through mecha- 
nisms that cause different genes to he transcribed in different cells. Since many 
specialized animal cells can maintain their unique character through many cell 
division cycles and even when grown in culture, the gene regulatory mechanisms 
involved in creating them must be stable once established and heritable when the 
cell divides. These features endow the cell with a memory of its developmental history. 
Bacteria and yeasts provide unusually accessible model systems in which to study 
gene regulatory mechanisms. One such mechanism involves a competitive interac- 
tion between two gene regulatory proteins, each of which inhibits the synthesis of the 
other; this can create a flip-flop switch that switches a cell between two alternative 
patterns of gene expression. Direct or indirect positive feedback loops, which enable 
gene regulatory proteins to perpetuate their own synthesis, provide a general mech- 
anism for cell memory. Negative feedback loops with programmed delays form the 
basis for cellular clocks. 

In eucaryotes the transcription of a gene is generally controlled by combinations 
of gene regulatory proteins. It is thought that each type of cell in a higher eucaryotic 
organism contains a specific combination of gene regulatory proteins that ensures 
the expression of only those genes appropriate to that type of cell A given gene regu- 
latory protein may be active in a variety of circumstances and typically is involved 
in the regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also used by eucaryotic cells to regulate gene expression. An espe- 
cially dramatic case is the inactivation of an entire X chromosome in female mam- 
mals. In vertebrates DNA methylation also functions in gene regulation, being used 
mainly as a device to reinforce decisions about gene expression that are made ini- 
tially by other mechanisms. DNA methylation also underlies the phenomenon of 
genomic imprinting in mammals, in which the expression of a gene depends on 
whether it was inherited from the mother or the father. 
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Figure 7-86 A mechanism to explain 
both the marked overall deficiency 
of CG sequences and their clustering 
into CG islands in vertebrate 
genomes. A black line marks the location 
of a CG dinudeotide in the DNA 
sequence, while a red "lollipop" indicates 
the presence of a methyl group on the 
CG dinudeotide. CG sequences that He in 
regulatory sequences of genes that are 
transcribed in germ cells are unmethylated 
and therefore tend to be retained in 
evolution. Methylated CG sequences, on 
the ofher hand, tend to be lost through 
deamination of 5-methyl C toT, unless the 
CG sequence is critical for survival. 



POSTTRANSCRIPTIONAL CONTROLS 

In principle, every step required for the process of gene expression could be 
controlled. Indeed, one can find examples of each type of regulation, although 
any one gene is likely to use only a few of them. Controls on the initiation of 
gene transcription are the predominant form of regulation for most genes. But 
other controls can act later in the pathway from DNA to protein to modulate 
the amount of gene product that is made. Although these posttranscriptional 
controls, which operate after RNA polymerase has bound to the gene's promoter 
and begun RNA synthesis, are less common than transcriptional control, for 
many genes they are crucial. 
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CHAPTER 29 

Regulation of transcription 



Xlte phenotypic differences thai distinguish the 
.various kinds of cells in a higher eukaryole are 
largely due to differences in the expression of 
?e nes that code Tor proteins, that is. those tran- 
scribed by RNA polymerase II. In principle, the 
expression of these genes might be regulated at 
aJ) v one ol' several stages. The concepi of the 
-level or control** implies that gene expression 
" i S not necessarily an automatic process once it 
has begun. It could be regulated in a gene- 
specific way ai any one or several sequential 
steps. We can distinguish (at least) live poten- 
tial control points, running the series: 

Activation of gene structure 
«i 

Initiation of transcription 
I 

Processing the transcript 
i 

Transport, to cytoplasm 
I 

Translation or mRNA 

The existence of the first step is implied by 
the discovery that genes may exist in either of 
two structural conditions, Relative to the state 
nf most of the genome, genes are found in 
;in "active*' stale In the cells in which they 
one expressed (see Chapter 27). The change of 
structure is. distinct from the act or transcrip- 
tion, and indicates that the gene is *transerib- 
able." This suggests that acquisition of the 
"active 1 ' structure must be the first step in gene 
expression. 

Transcription of a gene in the active stale is 



controlled at the stage of initiation, that is. by 
the interaction or RNA polymerase with its pro- 
moter. This is now becoming susceptible to 
analysis In the in viiro systems (see Chapter 
28). For most genes, this is a major control 
point; probably it is the most common level of 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription in eukary- 
otic cells, for example, via antitermination 
mechanisms. 

The primary transcript is modified by capping 
at the 5' end, and usually also by polvadenyla- 
tion at the 3' end. Introns must be spliced out 
from Ihe transcripts of interrupted genes. The 
mature RNA must be exported from, the nucleus 
to the cytoplasm. Regulation of gene expression 
by selection of sequences at the level of nuclear 
RNA might involve any or all of these stages, 
but the one for which we have most evidence 
concerns changes in splicing: some genes are 
expressed by means of alternative splicing pat- 
terns whose regulation controls the type or pro- 
tein product (see Chapter 30). 

Finally. Ihe translation of an mRNA in the cyto- 
plasm can be specifically controlled. There is little 
evidence for the employment of this mechanism in 
adult somatic cells, but it does occur in some 
embryonic situations, as described in Chapter 7. 
The mechanism is presumed lo involve the block- 
ing of initiation of translation of some mRNAs by 
specific protein factors. 

But having acknowledged that control or gene 
expression can occur at multiple stages, and 
that production of RNA cannot inevitably be 
equated with production of protein, it is clear 
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o 



that the ovenvhelming majority or regulatory 
events occur at^the^ initiation of transcription. 
Regulation of tissue-specific gene transcription 
lies at the heart of eukaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves lo provide 



common control of a large number of target 
genes, and we seek to answer two questions 
about this mode of regulation: what identifies 
the common target genes to the transcription 
factor; and how is the activity of the transcrip- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



Response elements identify genes under common 
regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they' share a ptvmoter element that is recognized 
by a regulatory transcription factor. An element 
that causes a gene to respond to such a factor 
is called a response element; examples are the 
HSE (heat shock response element), GRE 
(glucocorticoid response element), SRE (serum 
response element). 

The properties of some inducible transcription 
factors and the elements that they recognize are 
summarized in- Table 29.1. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
extends for a short distance on either side of 



Table 29.1 Inducible transcription factors bind to 
response elements that identify groups ol promoters 
or enhancers subject to coordinate control. 



Regulatory Agent Module Consensus 



Factor 



Heat shock HSE CNNGAANNTCCNNG HSTF 

Glucocorticoid GRE TGGTACAAATGTTCT Receptor 

Phorbol ester TRE TGACTCA API 

Serum SRE CCATATTAGG SRF 



the consensus sequence. In promoters, the ele- 
ments are not present at fixed distances frt>ni 
the startpoint, but are usualiy <200 bp upstream 
of it. The presence of a single element usually 
is sufficient to confer the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in P* 0 " 
•moters or in enhancers. Some types of element 
are typically found in one rather than the oti* p 
usually an HSE is found in a promoter, white ' 
GRE is found in an enhancer. We assume tM 
all response elements function by the san> e 
general principle. A gene is regulated by 0 
sequence at the promoter or enhancer thai *f 
recognized by a specific protein. The P^*!* 
Junctions as a transcription factor needed J* 
RKA polytnerase to initiate. Active protein ^ 
available only under conditions when the, gen? 
to be expressed; its absence means that the f* 1 
moter is not activated by this particular ****** }y 

An example of a situation in which 
genes are controlled by a single factor is r\ 
vided by the heat shock response. This is *^ 
mon to a wide range of prokary° tcS t1 f 
eukaryotes and involves multiple contro is 
gene expression: an increase in tetnp era ^ 
turns off transcription of some genes, lurnS a ,|d 
transcription of the heat shock S eneS, Rj ^v- 
causes changes in the translation of m Jf > 
The control of the heat shock genes i J j uSl 
the differences between prokaryotic ^ 
eukaryotic modes of control In. bacteria, a ^4 
sigma factor is synthesized that directs ^ |rr - 
polymerase holoenzyme to recognize an - 
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Background: Prostate stem ceil antigen (PSCA) is a recently defined homologue of the Thy-l/Ly-6 family of 
giycosylphosphatidylinositol (GPI)-anchored cell surface antigens. The purpose of the present study was to 
examine the expression status of PSCA protein and mRNA in clinical specimens of human prostate cancer (Pea) 
and to validate it as a potential molecular target for diagnosis and treatment of Pea. 

Materials and Methods: Immunohistochemical (IHC) and in situ hybridization (ISH) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections from 20 benign prostatic hyperplasia 
(BPH), 20 prostatic intraepithelial neoplasm (PIN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
independent prostate cancers. The level of PSCA expression was semiquantitative!/ scored by assessing both the 
percentage and intensity of PSCA-positive staining cells in the specimens. Then compared PSCA expression 
between BPH, PIN and Pea tissues and analysed the correlations of PSCA expression level with pathological grade, 
clinical stage and progression to androgen-independence in Pea. 

Results: In BPH and low grade PIN, PSCA protein and mRNA staining were weak or negative and less intense 
and uniform than that seen in HGPIN and Pea. There were moderate to strong PSCA protein and mRNA 
expression in 8 of 1 1 (727%) HGPIN and in 40 of 48 (83.4%) Pea specimens examined by IHC and ISH analyses, 
with statistical significance compared with BPH (20%) and low grade PIN (22.2%) samples (p < 0.05, respectively). 
The expression level of PSCA increased with high Gleason grade, advanced stage and progression to androgen- 
independence (p < 0.05. respectively). In addition, IHC and ISH staining showed a high degree of correlation 
between PSCA protein and mRNA overexpression. 

Conclusions: Our data demonstrate that PSCA as a new cell surface marker is overexpressed by a majority of 
human Pea. PSCA expression correlates positively with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and androgen-independence, and 
speculatively with prostate carcinogenesis. PSCA protein overexpression results from upregulated transcription 
of PSCA mRNA. PSCA may have prognostic utility and may be a promising molecular target for diagnosis and 
treatment of Pea. 
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Introduction 

Prostate cancer (Pea) is the second leading cause of can- 
cer-related death in American men and is becoming a 
common cancer increasing in China, Despite recently 
great progress in the diagnosis and management of local- 
ized disease, there continues to be a need for new diagnos- 
tic markers that can accurately discriminate between 
indolent and aggressive variants of Pea. There also contin- 
ues to be a need for the identification and characterization 
of potential new therapeutic targets on Pea cells. Current 
diagnostic and therapeutic modalities for recurrent and 
metastatic Pea have been limited by a lack of specific tar- 
get antigens of Pea. 

Although a number of prostate-specific genes have been 
identified (i.e. prostate specific antigen, prostatic acid 
phosphatase, glandular kallikrein 2), the majority of these 
are secreted proteins not ideally suited for many immuno- 
logical strategies. So, the identification of new cell surface 
antigens is critical to the development of new diagnostic 
and therapeutic approaches to the management of Pea. 

Reiter RE et al [1] reported the identification of prostate 
stem cell antigen (PSCA), a cell surface antigen that is pre- 
dominantly prostate specific. The PSCA gene encodes a 
123 amino acid glycoprotein, with 30% homology to 
stem cell antigen 2 (Sea 2). Like Sca-2, PSCA also belongs 
to a member of the Thy-l/Ly-6 family and is anchored by 
a glycosylphosphatidylinositol (GPI) linkage. mRNA in 
situ hybridization (ISH) localized PSCA expression in nor- 
mal prostate to the basal cell epithelium, the putative 
stem cell compartment of prostatic epithelium, suggesting 
that PSCA may be a marker of prostate stem/progenitor 
cells. 

In order to examine the status of PSCA protein and mRNA 
expression in human Pea and validate it as a potential 
diagnostic and therapeutic target for Pea, we used immu- 
nohistochemistry (IHC) and in situ hybridization (ISH) 
simultaneously, and conducted PSCA protein and mRNA 
expression analyses in paraffin-embedded tissue speci- 
mens of benign prostatic hyperplasia (BPH, n ■ 20), pros- 
tate intraepithelial neoplasm (PIN, n = 20) and prostate 
cancer (Pea, n = 48). Furthermore, we evaluated the possi- 
ble correlation of PSCA expression level with Pea tumori- 
genesis, grade, stage and progression to androgen- 
independence. 

Materials and methods 
Tissue samples 

All of the clinical tissue specimens studied herein were 
obtained from 80 patients of 57-84 years old by prostate- 
ctomy, transurethral resection of prostate (TURP) or biop- 
sies. The patients were classified as 20 cases of BPH, 20 
cases of PIN, 40 cases of primary Pea, including 9 patients 



with recurrent Pea and a history of androgen ablation 
therapy (orchiectomy and/or hormonal therapy), who 
were referred to as androgen-independent prostate can- 
cers. Eight specimens were harvested from these andro- 
gen-independent Pea patients prior to androgen ablation 
treatment. Each tissue sample was cut into two parts, one 
was fixed in 10% formalin for IHC and the other treated 
with 4% paraformaldehyde/0. 1 M PBS PH 7.4 in 0.1% 
DEPC for 1 h for ISH analysis, and then embedded in par- 
affin. All paraffin blocks examined were then cut into 5 
um sections and mounted on the glass slides specific for 
IHC and ISH respectively in the usual fashion. H&E- 
stained section of each Pea was evaluated and assigned a 
Gleason score by the experienced urological pathologist at 
our institution based on the criteria of Gleason score (2j. 
The Gleason sums are summarized in Table 1. Clinical 
staging was performed according to Jewett-whitmore- 
prout staging system, as shown in Table 2. In the category 
of PIN, we graded the specimens into two groups, i.e. low 
grade PIN (grade I - II) and high grade PIN (HGPIN, 
grade III) on the basis of literatures |3,4j. 

Imrrtunohistochemlcat (IHC) analysis 

Briefly, tissue sections were deparaffinized, dehydrated, 
and subjected to microwaving in 10 mmol/L citrate 
buffer, PH 6.0 (Boshide, Wuhan, China) in a 900 W oven 
for 5 min to induce epitope retrieval Slides were allowed 
to cool at room temperature for 30 min. A primary mouse 
antibody specific to human PSCA (Boshide, Wuhan, 
China) with a 1:100 dilution was applied to incubate with 
the slides at room temperature for 2 h. Labeling was 
detected by sequentially adding biotinylated secondary 
antibodies and strepavidin-peroxidase, and localized 
using 3,3'-diaminobenzidine reaction. Sections were then 
counterstained with hematoxylin. Substitution of the pri- 
mary antibody with phosphate-buffered-saline (PBS) 
served as a negative-staining control. 

mRNA in situ hybridization (ISH) 

Five-um-thick tissue sections were deparaffinized and 
dehydrated, then digested in pepsin solution (4 mg/ml in 
3% citric acid) for 20 min at 37.5 °C, and further proc- 
essed for ISH. Digoxigenin-labeled sense and antisense 
human PSCA RNA probes (obtained from Boshide, 
Wuhan, China) were hybridized to the sections at 48 <I C 
overnight. The posthybridization wash with a high strin- 
gency was performed sequentially at 37° C in 2 x standard 
saline citrate (SSC) for 10 min, in 0.5 * SSC for 15 min 
and in 0.2 * SSC for 30 min. The slides were then incu- 
bated to biotinylated mouse anti-digoxigenin antibody at 
37.5°Cforl h followed by washing in 1 * PBS for 20 min 
at room temperature, and then to strepavidin-peroxidase 
at 37.5°C for 20 min followed by washing in 1 x PBS for 
15 min at room temperature. Subsequently, the slides 
were developed with diaminobenzidine and then coun- 
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Table I : Correlation of PSCA expression with Gleason score 



Intensity * frequency 



Gleason score 



0-6 « 



9(%) 



2-4 

5- 7 

6- I0 



5(83) 
19(79) 
• 5(28) 



1(17) 
5(21) 
13(72) 



Table 2: Correlation of PSCA expression with clinical stage 



Intensity * frequency 



Tumor stage 



0-6 TO 



9(%) 



27 (67.5) 
2(25) 



13(32.5) 
6(75) 



terstained with hematoxylin to localize the hybridization 
signals. Sections hybridized with the sense control probes 
routinely did not show any specific hybridization signal 
above background. All slides were hybridized with PBS to 
substitute for the probes as a negative control. 

Scoring methods 

To determine the correlation between the results of PSCA 
immunostaining and mRNA in situ hybridization, the 
same scoring manners are taken in the present study for 
PSCA protein staining by IHC and PSCA mRNA staining 
by ISH- Each slide was read and scored by two independ- 
endy experienced urological pathologists using Olympus 
BX-41 light microscopes. The evaluation was done in a 
blinded fashion. For each section, five areas of similar 
grade were analyzed semiquantitatively for the fraction of 
cells staining. Fifty percent of specimens were randomly 
chosen and rescored to determine the degree of interob- 
server and intraobserver concordance. There was greater 
than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopi- 
cally was graded on a scale of 0 to 3+ with 3 being the 
highest expression observed (0, no staining; 1+, mildly 
intense; 2+, moderately intense; 3+, severely intense). The 
staining density was quantified as the percentage of cells 
staining positive for PSCA with the primary antibody or 
hybridization probe, as follows: 0 = no staining; 1 = posi- 
tive staining in <25% of the sample; 2 * positive staining 
in 25%-50% of the sample; 3 = positive staining in >50% 



of the sample. Intensity score (0 to 3+) was multiplied by 
the density score (0-3) to give an overall score of 0-9 
[1,5]. In this way, we were able to differentiate specimens 
that may have had focal areas of increased staining from 
those that had diffuse areas of increased staining [6]. The 
overall score for each specimen was then categorically 
assigned to one of the following groups: 0 score, negative 
expression; 1-2 scores, weak expression; 3-6 scores, mod- 
erate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA protein and mRNA expres- 
sion in BPH, PIN and Pea tissues were compared using the 
Chi-square and Student's t-test. Univariate associations 
between PSCA expression and Gleason score, clinical 
stage and progression to androgen- independence were 
calculated using Fisher's Exact Test. For all analyses, p < 
0.05 was considered statistically significant. 

Results 

PSCA expression in BPH 

In general, PSCA protein and mRNA were expressed 
weakly in individual samples of BPH. Some areas of 
prostate expressed weak levels (composite score 1-2), 
whereas other areas were completely negative {composite 
score 0). Four cases (20%) of BPH had moderate expres- 
sion of PSCA protein and mRNA (composite score 4-6) 
by IHC and ISH. In 2/20 (10%) BPH specimens, PSCA 
mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 
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2) in one and negative (composite score 0) in the other. 
PSCA expression was localized to the basal and secretory 
epithelial cells, and prostatic stroma was almost negative 
staining for PSCA protein and mRNA in all cases 
examined. 

PSCA expression In PIN 

In this study, we detected weak or negative expression of 
PSCA protein and mRNA (£2 scores) in 7 of 9 (77.8%) 
low grade PIN and in 2 of 11 (18.2%) HGPIN, and mod- 
erate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 1 1 (45,5%) HGPIN. One HGPIN with moderate 
PSCA mRNA expression (6 score) was found weak stain- 
ing for PSCA protein (2 score) by IHC. Strong PSCA pro- 
tein and mRNA expression (9 score) were detected in the 
remaining 3 of 11 (27.3%) HGPIN. There was a statisti- 
cally significant difference of PSCA protein and mRNA 
expression levels observed between HGPIN and BPH (p < 
0.05), but no statistical difference reached between low 
grade PIN and BPH (p > 0.05). 

PSCA expression In Pea 

In order to determine if PSCA protein and mRNA can be 
detected in prostate cancers and if PSCA expression levels 
are increased in malignant compared with benign glands, 
Forty-eight paraffin-embedded Pea specimens were ana- 
lysed by IHC and ISH. It was shown that 19 of 48 (39.6%) 
Pea samples stained very strongly for PSCA protein and 
mRNA with a score of 9 and another 21 (43.8%) speci- 
mens displayed moderate staining with scores of 4-6 (Fig- 
ure 1). In addition, 4 specimens with moderate to strong 
PSCA mRNA expression (scores of 4-9) had weak protein 
staining (a score of 2) by IHC analyses. Overall, Pea 
expressed a significantly higher level of PSCA protein and 
mRNA than any other specimen category in this study (p 
< 0.05, compared with BPH and PIN respectively). The 
result demonstrates that PSCA protein and mRNA are 
overexpressed by a majority of human Pea. 

Correlation of PSCA expression with Gleason score in Pea 

Using the semi-quantitative scoring method as described 
in Materials and Methods, we compared the expression 
level of PSCA protein and mRNA with Gleason grade of 
Pea, as shown in Table 1. Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-differentia- 
tion, 5-7 scores = moderate-differentiation and 8-10 
scores » poor-differentiation [7|. Seventy-two percent of 
Gleason scores 8-10 prostate cancers had very strong 
staining of PSCA compared to 21% with Gleason scores 
5-7 and 17% with 2-4 respectively, demonstrating that 
poorly differentiated Pea had significantly stronger 
expression of PSCA protein and mRNA than moderately 
and well differentiated tumors (p < 0.05). As depicted in 
Figure 1, IHC and ISH analyses showed that PSCA protein 
and mRNA expression in several cases of poorly differen- 



tiated Pea were particularly prominent, with more intense 
and uniform staining. The results indicate mat PSCA 
expression increases significantly with higher tumor grade 
in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 

With regards to PSCA expression in every stage of Pea, we 
showed the results in Table 2. Seventy-five percent of 
locally advanced and node positive cancers (i.e. C-D 
stages) expressed statistically high levels of PSCA versus 
32.5% that were organ confined (i.e. A-B stages) (p < 
0.05). The data demonstrate that PSCA expression 
increases significantly with advanced tumor stage in 
human Pea. 

Correlation of PSCA expression with androgen- 
independent progression of Pea 

All 9 specimens of androgen-independent prostate can- 
cers stained positive for PSCA protein and mRNA. Eight 
specimens were obtained from patients managed prior to 
androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the 
strongest staining category (score = 9), compared with 
three out of eight (37.5%) of patients with androgen- 
dependent cancers (p < 0.05). The results demonstrate 
that PSCA expression increases significantly with progres- 
sion to androgen-independence of human Pea. 

It is evident from the results above that within a majority 
of human prostate cancers the level of PSCA protein and 
mRNA expression correlates significantly with increasing 
grade, worsening stage and progression to androgen-inde- 
pendence. 

Correlation of PSCA immunostainlng and mRNA in situ 
hybridization 

In all 88 specimens surveyed herein, we compared the 
results of PSCA IHC staining with mRNA ISH analysis. 
Positive staining areas and its intensity and density scores 
evaluated by IHC were identical to those seen by ISH in 79 
of 88 (89.8%) specimens (18/20 BPH, 19/20 PIN and 42/ 
48 Pea respectively). Importantly, 27/27 samples with 
PSCA mRNA composite scores of 0-2, 32/36 samples 
with scores of 3-6 and 22/24 samples with a score of 9 
also had PSCA protein expression scores of 0-2, 3-6 and 
9 respectively. However, in 5 samples with PSCA mRNA 
overall scores of 3-6 and in 2 with scores of 9 there were 
less or negative PSCA protein expression (i.e. scores of 0- 
4), suggesting that this may reflect posttranscriptional 
modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data 
demonstrate that the results of PSCA immunostaining 
were consistent with those of mRNA ISH analysis, show- 
ing a high degree of correlation between PSCA protein 
and mRNA expression. 
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Figure I 

Representatives of PSCA IHC and ISH staining in Pea (A. IHC staining, B. ISH staining, *200 magnification). A,, B,: negative con- 
trol of IHC and ISH. PBS replacing the primary antibody (A,) and hybridization with a sense PSCA probe (B,) showed no back- 
ground staining. A 2 . Bj: a moderately differentiated Pea (Gleason score = 3+3 = 6) with moderate staining (composite score = 
6) In all malignant cells; A 2 : IHC shows not only cell surface but also apparent cytoplasmic staining of PSCA protein. A 3 , B 3 : a 
poorly differentiated Pea (Gleason score = 4+4 = 8) with very strong staining (composite score = 9) in all malignant cells. 



Page 5 of 7 

(page number not for citation purposes) 



World Journal of Surgical Oncology 2004, 2 



http7Avww.wjso.eom/content/2/1/1 3 



Discussion 

PSCA is homologous to a group of cell surface proteins 
that mark the earliest phase of hematopoietic develop- 
ment. PSCA mRNA expression is prostatenspedfic in nor- 
mal male tissues and is highly up-regulated in both 
androgen-dependent and-independent Pea xenografts 
(IAPC-4 tumors). We hypothesize that PSCA may play a 
role in Pea tumorigenesis and progression, and may serve 
as a target for Pea diagnosis and treatment. In this study, 
IHC and ISH showed that in general there were weak or 
absent PSCA protein and mRNA expression in BPH and 
low grade PIN tissues. However, PSCA protein and mRNA 
are widely expressed in HGPIN, the putative precursor of 
invasive Pea, suggesting that up-regulation of PSCA is an 
early event in prostate carcinogenesis. Recently, Reiter RE 
etal [1], using ISH analysis, reported that 97 of 1 18 (82%) 
HGPIN specimens stained strongly positive for PSCA 
mRNA. A very similar finding was seen on mouse PSCA 
(mPSCA) expression in mouse HGPIN tissues by Tran C. 
P et al [8]. These data suggest that PSCA may be a new 
marker associated with transformation of prostate cells 
and tumorigenesis. 

We observed that PSCA protein and mRNA are highly 
expressed in a large percentage of human prostate cancers, 
including advanced, poorly differentiated, androgen- 
independent and metastatic cases. Fluorescence-activated 
cell sorting and confocal/ immunofluorescent studies 
demonstrated cell surface expression of PSCA protein in 
Pea cells [9). Our IHC expression analysis of PSCA shows 
not only cell surface but also apparent cytoplasmic stain- 
ing of PSCA protein in Pea specimens (Figure 1). One pos- 
sible explanation for this is that anti-PSCA antibody can 
recognize PSCA peptide precursors that reside in the cyto- 
plasm. Also, it is possible mat the positive staining that 
appears in the cytoplasm is actually from the overlying 
cell membrane [5]. These data seem to indicate that PSCA 
is a novel cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression 
conelates with high grade (i.e. poor differentiation), 
increased tumor stage and progression to androgen-inde- 
pendence of Pea. These findings support the original IHC 
analyses by Gu Zet al [9], who reported that PSCA protein 
expressed in 94% of primary Pea and the intensity of 
PSCA protein expression increased with tumor grade, 
stage and progression to androgen-independence. Our 
results also collaborate the recent work of Han KR et al 
[10], in which the significant association between high 
PSCA expression and adverse prognostic features such as 
high Gleason score, seminal vesicle invasion and capsular 
involvement in Pea was found. It is suggested that PSCA 
overexpression may be an adverse predictor for recur- 
rence, clinical progression or survival of Pea. Hara H et ai 
|11] used RT-PCR detection of PSA, PSMA and PSCA in 1 



ml of peripheral blood to evaluate Pea patients with poor 
prognosis. The results showed mat among 58 PCa 
patients, each PCR indicated the prognostic value in the 
hierarchy of PSCA>PSA>PSMA RT-PCR, and extraprostatic 
cases with positive PSCA PCR indicated lower disease-pro- 
gression-free survival than those with negative PSCA PCR, 
demonstrating that PSCA can be used as a prognostic fac- 
tor. Dubey P et al |12] reported that elevated numbers of 
PSCA + cells correlate positively with the onset and devel- 
opment of prostate carcinoma over a long time span in 
the prostates of the TRAMP and PTEN +/- models com- 
pared with its normal prostates. Taken together with our 
present findings, in which PSCA is overexpressed from 
HGPIN to almost frank carcinoma, it is reasonable and 
possible to use increased PSCA expression level or 
increased numbers of PSCA-positive cells in the prostate 
samples as a prognostic marker to predict the potential 
onset of this cancer. These data raise the possibility that 
PSCA may have diagnostic utility or clinical prognostic 
value in human Pea. 

The cause of PSCA overexpression in Pea is not known. 
One possible mechanism is that it may result from PSCA 
gene amplification. In humans, PSCA is located on chro- 
mosome 8q24.2 (1], which is often amplified in meta- 
static and recurrent Pea and considered to indicate a poor 
prognosis [13-15]. Interestingly, PSCA is in close proxim- 
ity to the c-myc oncogene, which is amplified in >20% of 
recurrent and metastatic prostate cancers [16,17]. Reiter 
RE et al [ 18] reported that PSCA and MYC gene copy num- 
bers were co-amplified in 25% of tumors (five out of 
twenty), demonstrating that PSCA overexpression is asso- 
ciated with PSCA and MYC coamplitlcation in Pea. Gu Z 
et al [9] recently reporteted that in 102 specimens availa- 
ble to compare the results of PSCA immunostaining with 
their previous mRNA ISH analysis, 92 (90.2%) had iden- 
tically positive areas of PSCA protein and mRNA expres- 
sion. Taken together with our findings, in which we 
detected moderate to strong expression of PSCA protein 
and mRNA in 34 of 40 (85%) Pea specimens examined 
simultaneously by IHC and ISH analyses, it is demon- 
strated that PSCA protein and mRNA overexpressed in 
human Pea, and that the increased protein level of PSCA 
was resulted from the upregulated transcription of its 
mRNA. 

At present, the regulation mechanisms of human PSCA 
expression and its biological function are yet to be eluci- 
dated. PSCA expression may be regulated by multiple fac- 
tors [18). WatabeTetal [19] reported that transcriptional 
control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be 
regulated or mediated through cell-cell contact and pro- 
tein kinase C (PKC) [20]. Homologues of PSCA have 
diverse activities, and have themselves been involved in 
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carcinogenesis. Signalling through SCA-2 has been dem- 
onstrated to prevent apoptosis in immature thymocytes 
[21]. Thy-1 is involved in T cell activation and txansducts 
signals through src-like tyrosine kinases [22]. Ly-6 genes 
have been implicated both in tumorigenesis and in cell- 
cell adhesion (23-25). Cell-cell or cell-matrix interaction is 
critical for local tumor growth and spread to distal sites. 
From its restricted expression in basal cells of normal 
prostate and its homology to SCA-2, PSCA may play a role 
in stem/progenitor cell function, such as self-renewal (i.e. 
anti-apoptosis) and/or proliferation [1], Taken together 
with the results in the present study, we speculate that 
PSCA may play a role in tumorigenesis and clinical pro- 
gression of Pea through affecting cell transformation and 
proliferation. From our results, it is also suggested that 
PSCA as a new cell surface antigen may have a number of 
potential uses in the diagnosis, therapy and clinical prog- 
nosis of human Pea. PSCA overexpression in prostate 
biopsies could be used to identify patients at high risk to 
develop recurrent or metastatic disease, and to discrimi- 
nate cancers from normal glands in prostatectomy sam- 
ples. Similarly, the detection of PSCA-overexpressing cells 
in bone marrow or peripheral blood may identify and pre- 
dict metastatic progression better than current assays, 
which identify only PSA-positive or PSMA-positive pros- 
tate cells. 

In summary, we have shown in this study that PSCA pro- 
tein and mRNA are maintained in expression from 
HGPIN through all stages of Pea in a majority of cases, 
which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell dif- 
ferentiation), advanced stage and androgen-independent 
progression. PSCA protein overexpression is due to the 
upregulation of its mRNA transcription. The results sug- 
gest mat PSCA may be a promising molecular marker for 
the clinical prognosis of human Pea and a valuable target 
for diagnosis and therapy of this tumor. 
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Abstract 

Translation Initiation is regulated In response to 
nutrient availability and mrtogonlc stimulation and Is 
coupled with ceD cycle progression and cell growth. 
Several alterations In translations! control occur In 
cancer. Variant mRNA sequences can alter the 
transiatlonal efficiency of Individual mRNA molecules, 
which In turn play a role In cancer biology. Changes In 
the expression or availability of components of the 
transnational machinery and in the activation of 
translation through signal transduction pathways can 
lead to more global changes, such as an Increase in 
the overall rate of protein synthesis and translations! 
activation of the mRNA molecules Involved In cell 
growth and proliferation. We review the basic 
principles of translations! control, the alterations 
encountered In cancer, and selected therapies 
targeting translation Initiation to help elucidate new 
therapeutic avenues. 

Introduction 

The fundamental principle of molecular therapeutics In can- 
cer is to exploit the differences In gene expression between 
cancer cells and normal cells. With the advent of cDNA array 
technology, most efforts have concentrated on identifying 
differences In gene expression at the level of mRNA, which 
can be attributable either to DNA amplification or to differ- 
ences in transcription. Gene expression is quite complicated, 
however, and Is also regulated at the level of mRNA stability, 
mRNA translation, and protein stability. 

The power of translational regulation has been best recog- 
nized among developmental biologists, because transcription 
does not occur In earfy ernbryogenesls in eukaryotes. For ex- 
ample. In Xenopus, the period of transcriptional quiescence 
continues unta the embryo reaches mldblastula transition, the 
4000-celi staga Therefore, all necessary mRNA molecules are 
transcribed during oogenesis and stockpiled In a translationally 
Inactive, masked form. The mRNA are transIationaOy activated 
at appropriate times during oocyte maturation, fertilization, and 



early embryogenesls and thus, are under strict translations! 
control. 

Translation has an established role In cell growth. Basi- 
cally, an Increase In protein synthesis occurs as a conse- 
quence of mitogenesls. Until recently, however, little was 
known about the alterations in mRNA translation in cancer, 
said much is yet to be discovered about their rote In the 
development and progression of cancer. Here we review the 
basic principles of transitional control, the alterations en- 
countered In cancer, and selected therapies targeting transla- 
tion initiation to elucidate potential new therapeutic avenues. 

Basic Principles of Translations! Control 
Mechanism of Translation Initiation 
Translation Initiation is the main step in translators! regulation. 
Translation Initiation Is a complex process in whta the InJtJator 
tRNA and the 40$ and 60S r&osomal subunte are recruited to 
the 5' end of a mRNA molecule and assembled by eukaryotic 
translation initiation factors Into an 80S rtbosome at the start 
codon of the mRNA (Rg. 1)i The 5' end of eukaryotic mRNA Is 
capped, Le., contains the cap structure m 7 GpppN (/-vneHhyX' 
guanostoe-trzpfK>spr^ Most translation In 

eukaryotes occurs In a cap-dependent fashion, /.a, the cap Is 
specifically recognized by the el F4E, a which binds the 5' cap. 
The elF4F translation initiation complex Is then formed by the 
assembly of elF4E, the RNA heDcase eiF4A, and elF4G, a 
scaffolding protein that mediates the binding of the 40S ribo- 
sorrral subunK to the mRNA molecule through Interaction with 
theelF3 protein present on the 40S ribosoma elF4Aand eiF4B 
participate in melting the secondary structure of the 5' LTTR of 
the mRNA. The 43S Initiation complex (40S/elF2/MeHRNA/ 
GTP compter scans the mRNA in a 5'-*3' direction until it 
encounters an AUG start codon. This start codon Is then base- 
paired to the antlcodon of initiator tRNA, forming the 48$ initi- 
ation complex. The Initiation factors are then displaced from the 
48S complex, and the 60S ribosome joins to form the 80S 
ribosoma 

Unlike most eukaryotic translation, translation initiation of 
certain mRNAs, such as the plcomavinjs RNA, Is cap inde- 
pendent and occurs by internal ribosome entry. This mecha- 
nism does not require eiF4£. Either the 43S complex can bind 
the initiation codon directiy through interaction with the IRES In 
the 5' UTR such as In the encephalomyocarditis virus, or it can 
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Initially attach to the IRES and then reach the initiation codon by 
scanning or transfer, as Is the case with the pollovirus (1). 

Regulation of Translation initiation 
Translation Initiation can be regulated by alterations In the 
expression or phosphorylation status of the various factors 
involved. Key components In translations regulation that 
may provide potential therapeutic targets follow. 

e!F4E elF4E plays a central role in translation regulation. 
It Is the least abundant of the initiation factors and is con- 
sidered the rate-limiting component for initiation of cap. 
dependent translation. elF4E may also be Involved In mRNA 
splicing, mRNA 3' processing, and mRNA nucfeocytopfas- 
mlc transport (2). eIF4E expression can be Increased at the 
transcriptional level In response to serum or growth factors 
(3), elF4E overexpression may cause preferential translation 
of mRNAs containing excessive secondary structure in their 
5' UTR that are normally discriminated against by the trans- 



latlonal machinery and thus are Inefficiently translated (4-7). 
As examples of this, overexpression of eIF4E promotes In- 
creased translation of vascular endothelial growth factor 
fibroblast growth factor-2, and cyclln D1 (2, 8, 9). 

Another mechanism of control Is the regulation of elF4E 
phosphorylation. elF4E phosphorylation is mediated by the 
mftogen-activated protein Wnase-lnteracting kinase 1, which 
Is activated by the mtoogen-activated pathway activating 
extracellular signal-related kinases and the stress-activated 
pathway acting through p38 mitogen-actlvated protein ki- 
nase (10-13). Several mitogens, such as serum, platelet- 
derived growth factor, epidermal growth factor, Insulin, 
angiotensin II, src kinase overexpression, and ras over- 
expression, lead to eIF4E phosphorylation (14). The phos- 
phorylation status of elF4E Is usually correlated with the 
transitional rate and growth status of the cell; however; 
elF4E phosphorytetlon has also been observed In response 
to some cellular stresses when translatlonal rates actually 
decrease (15). Thus, further study Is needed to understand 
the effects of e!F4E phosphorylation on eIF4E activity. 

Another mechanism of regulation Is the alteration of elF4E 
availability by the binding of e!F4E to the elf=4&blhdlng pro- 
teins (4E-BP, also known as PHAS-J). 4E-BPs compete with 
eIF4G for a binding site In elF4E. The binding of elF4E to the 
best characterized elF4E-blndlng protein, 4E-BP1, is regu- 
iated by 4E-BP1 phosphorylatioa Hypopho^horylated 4E- 
BP1 binds to eIF4^ whereas 4E-BP1 fryrjerphc^prK>ry1ation 
decreases this binding. Insulin, angiotensin, epidermal 
growth factor, platelet-derived growth factor, hepatocyt© 
growth factor, nerve growth factor, Insulln-Mkeg*>wth factors 
I and 11, mterteuWn 3, granulocyte-macrophage colony-stim- 
ulating factor + steel factor, gastrin, and the adenovirus have 
all been reported to induce phosphorylation of 4E-BP1 and 
to decrease the ability of 4E-BP1 to bind e!F4E (15, 16). 
Conversely, deprivation of nutrients or growth factors results 
In 4E-BP1 dephosphorylatlon, an increase In elF4E binding, 
and a decrease In cap-dependent translation. 

p70 SB Kinase. Phosphorylation of ribosomal 40S protein 
S8 by S6K Is thought to play an Important role In translations! 
regulation. S6K mouse embryonic cells proliferate more 
slowly than do parental cells, demonstrating that S6K has a 
positive Influence on ceil proliferation (17). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
dlg^yrimidine tract (5' TOP) found at the 5' UTR of ribosomal 
protein mRNAs and other mRNAs coding for components of 
the transiatlonal machinery. Phosphorylation of S6K is regu- 
laled in part based on the availability of nutrients (18, 19) and is 
stimulated by several growth factors, such as platelet-derived 
growth factor and fnsuMke growth factor I (20). 

elF2<* Phosphorylation. The binding of the Initiator tRNA 
to the small ribosomal unit Is mediated by translation Initia- 
tion factor eIF2. Prrephorylation of the o-subunlt of eIF2 
prevents formation of the elF2/GTP/Met-tRNA complex and 
inhibits global protein synthesis (21, 22). elF2a is phospho- 
ryfated under a variety of conditions, such as viral Infection, 
nutrient deprivation, heme deprivation, and apoptosls {22). 
efF2a Is phosphoryfated by heme-rogulated Inhibitor, nutrient^ 
regulated protein kinase, and the IFN^nduced, double- 
stranded RNA-activated protein kinase (PKR; Ret. 23). 
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The mTOR Signaling Pathway. The macrollde antibiotic 
rapamycin (Sfcaflmus; Wyetb-Ayerst Research, Collegevflle, 
PA) has been the subject of intensive study because it in- 
hibits signal transduction pathways Involved in T-cefl activa- 
tion- Hie raparrydn-sensfflve component of these pathways 
Is mTOR (also called FRAP or RAFT1). mTOR is the mam- 
malian homologue of the yeast TOR proteins that regulate Q t 
progression and translation In response to nutrient availabil- 
ity (24). mTOR Is a serine-threonine kinase that modulates 
translation Initiation by altering the phosphorylation status of 
4E-BP1 and S6K (Rg. 2; Ref. 25). 

4E-BP1 b phossprroryfated on multiple residues. mTOR phos- 
phorylates the Thr-37 and Thr*46 residues of 4E-BP1 In vhro 
(26); however, phosphorylation at these sites Is not associated 
with a loss of elF4E binding. Phosphorylation of Trr-37 and 
Thr-46 is required for subsequent phosphorylation at several 
COOHrtermlnal, serum-sensitive sites; a combination of these 
Phosphorylation events appears to be needed to Inhibit the 
b!ndingof4E-BP1 to e4F4E (2^. The product of the/CTMgene,. 
P38/MSK1 pathway, and protein kinase Or aJsopbyaretefn 
4E-BP1 phosphorylation (27-29). 

S6K and 4E-BP1 are also regulated, In part, by PI3K and Its 
downstream protein kinase Akt PTEN Is a phosphatase that 
negatively regulates PBK signaling. PTEN nuD cells have 
constitutfvaly active of Akt, with increased S6K activity and 
S8 phosphorylation (30). S6K activity Is Inhibited both by 
PI3K Inhibitors wortmannln and LY294002 and by mTOR 
Inhibitor rapamycin (24). Akt phosphorylates Ser-2448 in 
mTOR In vitro, and this site Is phosphorylated upon Akt 
activation fn vivo (31-33). Thus, mTOR Is regulated by the 
Pf3K/Akt pathway; however, this does riot appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also regulates S6K and 4E-BP1 phosphorylation 
Independent of mTOR is controversial. 

Interestingly. mTOR autophosphorylatlon is blocked by wort- 
mannln but not by rapamycin (34). This seem^ hconstetency 
suggests that mTOR-resporelve regulation of 4E-BP1 and S6K 
activity occurs through a mechanism other than Intrinsic mTOR 
kinase activity. An alternate pathway for 4E-BP1 and S8K phos- 
phorylation by mTOR activity Is by the Inhibition of a phospha- 
tase. Treatment with caiyculin A, an Inhibitor of phosphatases 1 
and 2A, reduces rapamydrvlnduced dephosphorylation of 4E- 
BP1 and S6K by rapamycin (35). PP2A interacts with full-length 
S6K but not with a S6K mutant that is resistant to dephospho- 
rylation resulting from rapamycin. mTOR phosphorylates PP2A 
in vitro; however, how this process alters PP2A activity Is not 
known. These results are consistent with the nnxJel that phos- 
phorylation of a phosphatase by mTOR prevents dephospho- 
rylation of 4E-BP1 and S6K, and conversely, that nutrient dep- 
rivation and rapamycin block Inhibition of the phosphatase by 
mTOR. 

PoIyadenylatJon. Hie pofy(A) tall In eukaryotlc mRNA Is 
Important In enhancing translation Initiation and mRNA sta- 
bility. Polyadenylatfon plays a key role In regulating gene 
expression during oogenesis and early embryogenesls. 
Some mRNA that are translatlonally Inactive In the oocyte are 
polyadenylated concomitantly with transiationeJ activation In 
oocyte maturation, whereas other mRNAs that are transla- 
tlonally active during oogenesis are deadenytated and trans- 
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Fig. 2. Regulation of transition Inflation by signal transduction oath- 
ways. Signals via p38. extracoCuter dgnal-nateted kinase, PBKTand 
mTOR can aB activate translation Initiation, 



latlonaHy silenced (38-38). Thus, control of poly (A) tall syn- 
thesis Is an important regulatory step in gene expression. 
The 5' cap and poly(A) tan are thought to function synergist 
tically to regulate mRNA translatlonal efficiency (39, 40). 

RNA Packaging. Most RNA-blndlng proteins are assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translatJonal fate of the transcript (41). A highly 
conserved family of Y-box proteins Is found In cytoplasmic 
messenger ribonucleoproteln particles, where the proteins 
are thought to play a role In restricting the recruitment of 
mRNA to the translatJonal machinery (41-43). The major 
mRNA-assoclated protein, YB-1 , destabilizes the Interaction 
of elF4E and the 5 r mRNA cap in vitro, and overexpresslon of 
YB-1 results In translatlonal repression in vivo (44). Thus, 
alterations In RNA packaging can also play an Important role 
In translations! regulation. 

Translation Alterations Encountered in Cancer 

Three main alterations at the translations^ level occur In cancer 
variations In mRNA sequences that increase or decrease trans* 
latJonal efficiency, changes In the expression or availability of 
components of the translatlonal machinery, and activation of 
translation through aberrantly activated signal transduction 
pathways. The first alteration affects the translation of an indi- 
vidual mRNA that may play a role In carcinogenesis. "The sec- 
ond and third alterations can lead to more gjobal changes, such 
as an increase In the overaH rats of protein synthesis, and the 
translations activation of several mRNA species. 

Variations fn mRNA Sequence 
Variations In mRNA sequence affect the translatlonal effi- 
ciency of the transcript A brief description of these variations 
and examples of each mechanism follow. 

Mutations. Mutations in the mRNA sequence, especially 
In the 5' UTR, can alter Its translatlonal efficiency, as seen In 
the following examples. 



otryc Salto et a/, proposed that translation of full-length 
omyc Is repressed, whereas In several Buridtt lymphomas 
that have deletions of the mRNA 5' UTR, translation of wipe 
is more efficient (45). More recently, It was reported that the 
5' UTR of omyc contains an IRES, and thus o-myc transla- 
tion can be Initiated by a cap-Independent as well as a 
cap-dependent mechanism (48, 47). In patients with multiple 
myeloma, a C-VT mutation In the omyc IRES was Identified 
(46) and (bund to cause an enhanced Initiation of translation 
via Internal ribospmai entry (49). 

BRCA1. Asomatlc point mutation (117 G-»C) In position 
-3 with respect to the start codon of the BRCA1 gene was 
Identified In a highly aggressive sporadic breast cancer (50). 
Chimeric constructs consisting of the wild-type or mutated 
BRGA1 5' UTR and a downstream luciferase reporter dem- 
onstrated a decrease in thetranslational efficiency with the 5' 
UTR mutation. 

CycBn-dependent Kinase inhibitor 2A. Some Inherited 
melanoma kindreds have a G— >T transverslon at base -34 
of cycfin-dependent kinase Inhibitor-aA, which encodes a 
cycHrvdependent kinase 4/cyclin-dependent kinase 6 kinase 
Inhibitor Important In O, checkpoint regulation (51). This 
mutatkMvghres rise to a novel AUG translation Initiation 
codon, creating an upstream open reading frame that com- 
petes for scanning ribosomes and decreases translation 
from the wild-type AUG. 

Alternate Splicing and Alternate Transcription Start 
Sites. Alterations In splicing and alternate transcription sites 
can lead to variations in 5' UTR sequence, length, and second- 
ary structure, ultimately Impacting translations! efficiency. 

A7M. The ATM gene has four nonooding exons In its 5' 
UTR that undergo extensive alternative spacing (52). The 
contents of 12 different S' UTRs that show considerable 
diversity in length and sequence have been identified. These 
divergent 5' leader sequences play an important role In the 
translation al regulation of the ATM gene. 

mdru. In a subset of tumors, overexpresslon of the onco- 
protein mdm2 results In enhanced translation of the mdm2 
mRNA. Use of different promoters leads to two mdm2 tran- 
scripts that differ only fn their 5' leaders (53). The longer 5' 
UTR contains two upstream open reading frames, and this 
mRNA Is loaded with ribosomes Inefficiently compared with 
the short 5' UTR. 

BRCA1. In a normal mammary gland, BRCA1 mRNA Is 
expressed with a shorter leader sequence (5'UTRa), whereas 
In sporadic breast cancer tissue, BRCA1 mRNA Is expressed 
with a longer leader sequence (5 r UTRb); the transnational 
efficiency of transcripts containing 5' UTRb Is 10 times lower 
than that of transcripts containing 5' UTRa (54). 

TGF-03. TGF-f& mRNA includes a 1.1 -kb 5' UTR, which 
exerts an Inhibitory effect on translation. Many human breast 
cancer cefl fines contain a novel TGF-03 transcript with a5 p 
UTR that Is 870 nucleotides shorter and has a 7-fold greater 
transnational efficiency than the norma) TOF-/33 mRNA (55). 

Alternate Polyadenylation Sites. Multiple polyadenyl- 
atlon signals leading to the generation of several transcripts 
with differing 3' UTR have been described for several mRNA 
species, such as the RET proto-oncogene (56), ATM gene 
(52), tissue Inhibitor pf metalloprotelnases-3 p7), RHOA 



proto-oncogene (58), and calmodulin-! (59). Although the 
effect of these alternate 3' UTRs on translation Is not y&t 
known, they may be Important In RNA-protein interactions 
that affect translations! recruitment The role of these alter- 
ations In cancer development and progression Is unknown. 



Alterations in tho Components of the 
Translation Machinery 

Alterations In the components of translation machinery can 
take many forms. 

Overexpresssion of elF4E Overexpresslon of elF4E 
causes malignant transformation in rodent ceils (60) and the 
deregulation of HeLa cell growth (61). Potunovsky ef a/. (62) 
found that e!F4E overexpresslon substitutes for serum and 
Individual growth factors In preserving viability of fibroblasts, 
which suggests that elRE can mediate both proliferative and 
survival signaling. 

Elevated levels of eIF4E mRNA have been found in a broad 
spectrum of transformed cefl lines (63). e!F4E levels am 
elevated in all ductal carcinoma in situ specimens and Inva- 
sive ductal carcinomas, compared with benign breast spec- 
imens evaluated with Western blot analysis (64, 65). Prelim- 
inary studies suggest that this overexpresslon Is attributa^ 
to gene amplification (66). 

There are accumulating data suggesting that eJF4E overex- 
pression can be valuable a3 a prognostic marker. elF4E over- 
expression v^fcwd In a reti^^ 

poor prognosis In stages I to in breast carcinoma (67). Verifica- 
tion of the prognostic value of elF4E in breast cancer Is now 
under way In a prospective trial (67). However, In a different 
study, elF4E expression was correlated with the aggressive 
behavior of non4lodgWn , s lymphomas (68). in a prospective 
analysis of patients wHh head and neck cancer, elevated levels 
of elF4E In histologically tumor-free surgical margins predicted 
a slgnfflcantiy increased risk of locakegtonal recurrence (9). 
These results aS suggest that e!F4E overexpresslon can be 
used to select patients who might benefit from more aggressive 
systemic therapy. Furthermore, the head and neck cancer data 
suggest that e!F4E overexpresslon is a field defect and can be 
used to guide local therapy. 

Alterations in Other Initiation Factors. Alterations In a 
number of other Initiation factors have been associated with 
cancer. Overproduction of elF4Q, similar to elF4E, leads to 
malignant transformation in vitro (69). e!F-2a Is found in 
increased levels In bronchloloalveolar carcinomas of the lung 
(3). Initiation factor elF-4A1 is overaxpressed in melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunlt of 
translation Initiation factor 3 is amplified and overaxpressed 
In breast and prostate cancer (72), and the elF3-p1 10 subunft 
Is overaxpressed in testicular seminoma (73). The role that 
overexpresslon of these initiation factors plays on the devel- 
opment and progression of cancer, If any, is not known. 

Overexpresslon of S6K. S6K is amplified and highly 
overaxpressed In the MCF7 breast cancer cell line, com- 
pared with normal mammary epithelium (74). In a study by 
Bariund et a/. (74), S6K was amplified in 59 of 668 primary 
breast tumors, and a statistically significant association was 
observed between amplification and poor prognosis. 
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Overexpresslon of PAP. PAP catalyzes 3' poty(A) eyn- 
thesis PAP b overexpressed In human cancer ceils com- 
pared with normal and virally transformed cells (75). PAP 
enzymatic activity to breast tumors has been correlated with 
PAP protein levels (76) and, In mammary tumor cytosols, was 
found to bo an independent factor for predicting survival (76). 
Uttle Is known, however, about how PAP expression or ac- 
tivity affects the translations! profile. 

Alterations In RNA-Wnding Proteins. Even less Is known 
about alterations In RNA packaging In cancer. Increased ex- 
pression aid nuclear localization of the RNA-WndJng protein 
YB-1 are focfcators of a poor prognosis for breast cancar (77), 
rm-smaflc^Jungcanc^ 

ever, this effect may be mediated at least In part at the level of 
transcription, because YB-1 lrweasescr«moreslstanc»by^ 
handng the transcription of a multidrug resistance gene (80), 



Activation of Signal Transduction Pathways 

Activation of signal transduction pathways by loss of tumor 

suppressor genes croverexpresslonc^ 

can contribute to the growth and aggressiveness of tumors. An 

Importan^ rnutant In human cancers Is the tumor suppressor 

ger»P7BV,wWchtea^ 

way. Activation of PI3K and Akt Induces the oncogenic trans- 
formation of cHdcen embryo fOxx^ 
srrawcorstauthrepr«x^^ 

A mutant Akt that retains kinase activity but does not phos- 
phorylat8S6Kor4&BP1 does not transform fibroblasts, which 
suggests a correlation between the oncogenicity of P13K and 
Akt and the pricsphorylation of S6K and 4E-BP1 (81). 

Several tyrosine kinases such as platelet-derived growth 
factor, fnsulMke growth factor, HER2/heu, and epidermal 
growth factor receptor are overexpressed in cancer. Be- 
cause these kinases activate downstream signal transduc- 
tion pathways known to alter translation Initiation, activation 
of translation Is likely to contribute to the growth and aggres- 
siveness of these tumors. Furthermore, the mRNA for many 
of these kinases themselves are under translations! control. 
For example, HER2/neu mRNA is trartslatlonally controlled 
both by a short upstream open reading frame that represses 
HER2/heu translation In a cell type-Independent manner and 
by a distinct cell type-dependent mechanism that increases 
translations! efficiency (82). HER2/neu translation Is different 
in transformed and normal ceils. Thus, It Is possible that 
alterations at the transiationai level can In part account for 
the discrepancy between HEft2/neu gene amplification de- 
tected by fluorescence In situ hybridization and protein levels 
detected by Immunohlstochemfoal assays. 



Translation Targets of Selected Cancer Therapy 
Components of the translation machinery and signal path- 
ways Involved In the activation of translation initiation repre- 
sent good targets for cancer therapy. 

Targeting the mTOR Signaling Pathway: Rapamycin 
and Tumstatfn 

Rapamycin Inhibits the proliferation of lymphocytes. It was 
Initially developed as an Immunosuppressive drug for organ 



transplantation. Rapamycin with FKBP 12 <FK508-bindIng 
protein, M r 12,000) binds to mTOR to inhibit Hs function. 

Rapamycin causes a small but significant reduction In the 
Initiation rate of protein synthesis (83). It blocks ceil growth In 
part by blocking S8 phosphorylation and selectively su£>- 
pressing the translation of 5' TCP mRNAs, such as ribosomal 
proteins, and elongation factors (83-85). Rapamycin also 
blocks 4E-BP1 phosphorylation and Inhibits cap-dependent 
but not cap-Independent translation (1 7, 86). 

The rapamydn-sensmve signal transduction pathway, acti- 
vated during malignant trartsformaiton and cancer progression, 
Is no w being studied as a target for cancer therapy (87). Pros- 
tate, breast smaB cefl hxstg, gBob!astoma > melanorna,andT-ce& 
leukemia are among the cancer Ones most sensitive to the 
rapamycin analogue CCI-779 (yVyeih-Ayerst Research; Reff. 
87).lnrh$bdonrrycosarcorra 

static or cytocidaJ, depending on the p53 status of the celt p53 
wOoMype ceib treated with rapamycin arrest in the Ch phase 
and maintain thefr vlabUity, whereas p53 mutant cefe accumu- 
lateinG, are! undergo apoptosis (B8, 89). In a recently reported 
study using human primitive neuroectodermal tumor and 
meduiiobfastoma models, rapamycin exhibited more cytotox- 
icity in combination with dspfetin and camptothedn than as a 
single agent In vivo, CCI-779 delayed growth of xenografts by 
160% after 1 week of therapy and 240% after 2 weeks. Aslngte 
high-dose administration caused a 37% decrease in tumor 
volume. Growth inhibition tn vivo was 1.3 times greater, with 
cisplatin In combination with CCi-779 than with ci3platfn atone 
00). Thus, preclinical studies suggest that rapamycin ana- 
logues are useful as single agents and In combination with 
chemotherapy. 

Rapamycin analogues CCI-779 and RAD001 (Novartte, 
Basel, Switzerland) are now In clinical trials. Because of the 
known effect of rapamycin on lymphocyte proliferation, a 
potential problem with rapamycin analogues is Immunosup- 
pression. However, although prolonged Immunosuppression 
can result from rapamycin and CCI-779 administered on 
continuous-dose schedules, the Immunosuppressive effects 
of rapamycin analogues resolve In -24 h after therapy 
(91). The principal toxicities of CCI-779 have included der- 
matologies) toxicity, myebsuppression, infection, mucositis, 
diarrhea, reversible elevations In liver function tests, hyper- 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase 11 trials of CCI-779 have been conducted in 
advanced renal cell carcinoma and in stage Ill/TV breast 
carcinoma patients who felted with prior chemotherapy. In 
the results reported In abstract form, although there were no 
complete responses, partial responses were documented In 
both renal cell carcinoma and in breast carcinoma 04, 95). 
Thus, CCI-779 has documented preliminary clinical activity hi 
a previously treated, undetected patient population. 

Active Investigation is under way into patient selection for 
mTOR inhibitors. Several studies have found an enhanced 
efficacy of CCI-779 In PTEN-null tumors (30, 96). Another 
study found that six of eight breast cancer cell lines were 
responsive to CCi-779, although only two of these lines 
lacked PTEN (97) There was, however, a positive correlation 
between AW activation and CCI-779 sensitivity (97). This 
correlation suggests that activation of the P»3K-Akt pathway, 



9T& IVanslatton tnmatton tn Cancer 



regardless of whether It fs attributable to a PTEN mutation or 
tooverexpresslen of receptor tyrosine kinases, makes carr- 
eer cell amenable to mTOR-dlrected therapy. In contrast, 
lower levels of the target of mTOR, 4E-BP1, are associated 
with rapamycin resistance; thus, a lower 4E-BP1/elF4E ratio 
may predict rapamycln resistance (98). 

Another mode of activity for rapamycln and Its analogues 
appears to be through Inhibition of anglogenesls. This activ- 
ity may be both through direct Inhibition of endothelial cell 
proliferation as a result of mTOR Inhibition In these cells or by 
Inhibition of translation of such proanglogenlc factors as 
vascular endothelial growth factor In tumor cells (99, 100). 

The anglogenesls Inhibitor tumstatln, another anticancer 
drug currently under study, was also found recently to Inhibit 
translation In endothelial cells (101). Through a requisite In- 
teraction with integrin, tumstatln Inhibits activation of the 
PttWAto pathway and mTOR In endothelial cells and pre- 
vents dissociation of eIF4E from 4E-BP1, thereby Inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targeting 
the mTOR-sfgnallng pathway. 



Targetafg e!F2ce EPA, Clotrimazole, mda-7, 
and Ftevonokis 

EPA Is an n-3 polyunsaturated fatty acid found In the fish- 
based diets of populations having a low Incidence of cancer 
(102). EPA inhibits the proliferation of cancer cells (103), as 
well as In animal models (104, 1 05). It blocks cell division by 
inhibiting translation Initiation (105). EPA releases Ca 2+ from 
Intracellular stores while Inhibiting thefr. refilling, thereby ac- 
tivating PKR. PKR, in turn phosphorylates and inhibits eIF2a, 
resulting In the Inhibition of protein synthesis at the level of 
translation Initiation. Similarly, clotrimazole, a potent antipro- 
liferative agent In vitro and In vivo, Inhibits cell growth through 
depletion of Ca?+ stores, activation of PKR, and phospho- 
rylation of elF2a (106). Consequently, clotrimazole preferen- 
tially decreases the expression of cycllns A, E, and D1, 
resulting In blockage of the cell cycle In Q v 

mda-7 is a novel tumor suppressor gene being developed 
as a gene therapy agent Adenoviral' transfer of mda-7 (Ad- 
mda7) Induces apoptosis In many cancer cells Including 
breast, colorectal, and lung cancer (1 07-1 09). Ad-mda7 also 
Induces and activates PKR, which leads to phosphorylation 
of eIF2a and induction of apoptosis (1 10). 

Flavonolds such as genlstein and quercetin suppress tu- 
mor cefl growth. All three mammalian elF2a kinases, PKR, 
heme-regulated inhibitor, and PERK/PEK, are activated by 
flavonolds, with phosphorylation of elF2a and Inhibition of 
protein synthesis (111). 



Targeting elF4A and elF4E: Antisense RNA 
and Peptides 

Antisense expression of elF4A decreases the proliferation rate 
of melanoma eel Is (1 1 2). Sequestration of elF4E by overexpres- 
slon of 4E-BP1 Is proapoptotic and decreases tumorfgenlclty 
(113, 114). Reduction of elF4E with antisense RNA decreases 
soft agar growth, Increases tumor latency, and Increases the 
rates of tumor doubling times (7). Antisense elF4E RNA treat- 



ment also reduces the expression of angiogenic factors (115) 
and has been proposed as a potential acBuvant therapy for head 
aixJ neck careers, partai^ i n 
surgical margins. Small molecule hhlbifore that bhxi the eF4Q/ 
4E-BP1 -binding domain of elF4E are proapoptotic (116) and 
are also being actively pursued. 



Exploiting Selective Translation for Gene Therapy 
A different therapeutic approach that takes advantage of the 
enhanced cap-dependent transition In cancer ceBs Is the use 
of gene therapy vectors encoding suicide genes with highly 
structured 5' UTRTr*^ 

disadvantage In norma! cells and not transfateweH whereas In 
carc^c^theywoitfdtnmsto 

the introduction of the 5' UTR of fibroblast growth factor-2 5 r to 
the coding sequence of herpes simplex vtnsstype-1 thymkSn& 
kinase gene, allows for selective translation of herpes simplex 
virus (ype-7 thymidine kinase gene In breast cancer ceO lines 
compared with normal mammary ceO lines and resets in se- 
lective sensitivity to ganciclovir (1 1 7). 



Toward the Future 

Translation Is a croctel process In ever/ ce!L However, several 
alterations ki translationaJ control occur In cancer. Cancer celis 
appear to need an aberrantly activated transiatJonaJ state for 
survival, thus allowing the targeting of translation Initiation with 
surprisingly low toxicity. Components of the translations] ma- 
chinery, such as elF4E, and signal transduction pathways Irv 
volved In translation Initiation, such mTOR, represent promising 
targets for cancer therapy. Inhibitors of the mTOR have already 
shown some preliminary activity in clinical trials. It is possible 
that with the development of better predictive markers and 
better patient selection, response rates to singJe-agent therapy 
can be Improved. Similar to other cytostatic agents, however, 
mTOR inhibitors are most Skety to achieve cKnlcaJ utiity In 
combination therapy. In the Interim, our Increasing understand- 
ing of translation Initiation and signal transduction pathways 
promise to lead to the identification of new therapeutic targets 
In tiie near future. 



Acknowledgments 

Wo thank Gayfe Nesom from Tne University of Texas M. D. Anderson 
Cancer Center Department of Scientific Publications for editorial assist- 
ance and Dr. Oner Bemstam for assistance with manuscript preparation. 



References 

1. Pestova*T.V. f Kctopaeva,V.a^ 

1. M, Agol, V. I. and Heflen, C. U Molecular mechanisms of translation 
initiation in eukaryotes. Proa NaU Acad. Set USA, 88: 7029^7036, 2001. 

2. RosenwaW, 1 8 M Kaspar, R, Rousseau, Gehrke, U Lebouteh, P.. 
Chen, J. J., Schmidt E. V.. Sonenberg, M, and London, I. M. Eukaryotic 
translation Initiation factor 4E regulates expression of cycUn D1 at Iran* 
scriptlonal and post-transcrfptiona! levels. J. Biol. Cham., 2710: 21176- 
21180,1995. 

3. RosenwaW, 1. a, Hutter, M. J., Wang, S., Savas, L, and Fraire, A. E 
Expression of eukaryctfc translation initiation factors 4E and 2a b In- 
creased frequently in bronchloioalveoiar but not in squamous ceil carci- 
nomas of the lung. Cancer (PhDa.), 92: 2164-2171, 2001 . 



4. Dafveau,A^Pefteifer,0.a^ 

ft vitro translation of mouse c-myc transci^ffiffer^htheB'inrtiBns* 
fated ragkm. Proa Natl Acad. ScL USA, 82; 2315-2319, 1985. 
8. Kozak, M. Influences* of mRNA secondary structure on Wtiation by 
eukajyotte ribosomes. Proc NatL Acad. 8cL USA, 83: 2850-2854, 1988. 
a KorooiUas, A. E, Lazsrts-Karatzas, A., and Soounberg, N. mRNAs 
oontaWng extensive secondary structure in their 5' non-coding region 
translate efflcJerrtiy In ceSa overwpressing Initiation factor el F-4E EMBO 
J., 1 7; 4163-4158, 1992. 

7. Rlnker^chaeffer, C. W, Graff, J. R, D* Benedetti, A., Zrmrner, a G., and 
Rhoacs, RE Decreashg the level of translation initiation factor 4E wfth 
arrtteense RNA causes reversal of ras-medlated transformation and tumori- 
panesfc of doned rat embryo fibroblasts. Int J. Cancer, 5$: 841 -847, 1 993. 

8. Kev3, C. a, De Benedetti, A. Payne, D. rC, Cbe, L U Laroux, F. S., 
and Alexander, J. $. Translational regulation of vascular permeability 
factor by eukaryotJc initiation factor 4fc implications for tumor angtogen- 
esis. tnL J. Cancer, 65; 785-790, 1996. 

a Nathan, C. A, Franklin, S., Abreo, F. W.. Nassar, a, Oe Benedetti, A., 
and Glass, J. Analysis of surgical margins with the molecular marker 
elF4E a prognostic factor In patients with head and neck cancer. J. Cilrt. 
OncoL, 77; 2909-2914, 1999. 

10. Fukunaga, R., end Hunter, T. MNK1, a new MAP kinase-activated 
protein kinase, isolated by a novel expression screening method for 
Identifying protein kinase substrates. EMBO J„ 16; 1921-1933, 1997. 

11. Waskfewfcz, A J., Flynn, A, Proud, C. G., and Cooper, J. A. Mitogen- 
activated protein kinases activate the serine/threonine kinases Mnkl and 
Mnk2. EMBO J., 16: 1909-1920, 1997. 

12. Wang, X, Flynn, A, Wasktowtez, A J, Webb, B. L, Vrles, R a, 
Beines, L A, Cooper, J. A, and Proud, a a The phosphorylation of 
euxaryotio Initiation factor elF4E in response to phorboi esters, cell 
stresses, end cytokines to mediated by distinct MAP kinase pathways. 
JL Biol. Chem., 273: 9373-S377, 1998. 

13. Pyronnet, S., Imataka, R, Glngras, A C.. Fukunaga, R, Hunter, T., 
and Sonenberg." N. Human eukaryotic translation initiation factor 4Q 
(0lrMG)iecrute Mnkl toptoeprwryiateelRE EMBO J^ 78:270-279.1999. 

14. Wegn, M„ Scheper, G. C„ Voorma. H. 0. andThomas, A A Regulation 
of translation initiation factors by signal transduction. Eur. J. Bfochem. 253: 
531-544,1999. 

15. Raught, B., and Glngras, A C. elF4E activity is regulated at multiple 
levels. Int J. B tochem. Ceil BbL. 37; 43-57, 1 999. 

18. Takeucrti. rt, Shlbamoto, S., Nagamlno, Shlgemori, I., Omura, S., 
Kftamum, N., and too, F. Signaling pathways leading to transcription and 
translation cooperatively regulate the transient increase m expression of 
c-FOs protein. J. BfoL Chem., 276V 26077-260B3, 2001. 

17. Kawasome, H., Papst P., Webb, S;, Keller. G. M-, Johnson, a L. 
Geffand, E W., and Terada, N. Targeted disruption of p70(86k) defines Hs 
role In protein synthesis and raparoycln sensitivity. Proc. Natl Acad. Scl 
USA, 95: 5033-5038, 1998. 

18. Christie, a R.. Hafduch, E, Hundal, H. S., Proud, C. a, and Taylor, 
P. M. Intracellular sensing of amino acids In Xenopus teevfc oocytes 
stimulates p70 S8 kinase In a target of rapamycin-dependent manner, 
d. Biol Chem., 277; 9952-9957, 2002. 

19. Hara,K,Yaie2awa,IC.Wen^ 

Avnicn, J. Amino acid sufficiency and mTt)R regulate p70 S6 kinase and 
e!F-4E BP1 through a common effector mechanism. J. BfoL Cheia, 273; 
14484-14494, 1998. 

20. Graves, L M., Bomfeldt, K. E, Argast, G. M., Krebs. E a> Kong, X., 
Un, T. A, and Lawrence, J. C, Jr. cAMP- and raparnycln-sensruve reg- 
ulation of the association of eukaryotlc initiation factor 4E and the trans- 
lationa! regutetor PHAS-I In aortic smooth muscle cefla Proc Natf. Acad 
Scl USA, 92: 7222-7226, 1995. 

21. Merrlc*.W.C.,aiKf Hershey, J. W.B. The p 

eukaryotlc protein synthesis, im J. W. B. Hershey and M. a Mathews 
(eds.), Translational Control, pp. 31-69. Cold Spring Harbor, NY: Cold 
Spring Harbor Laboratory, 1998. 

22. Kimball, S. ft. Eukaryotio Initiation factor e!F2. Int J. B tochem. Cell 
BJoL,S/;25~2Q,189& 

23. Jagus, R, JosW, B„ and Barber. G, H. PKR, apoptosls end cancer. 
Int J. 8tochem. Ceil Biol., $1: 123-138. 1999. 



24. Thoma3,a,aixlHa*M.hlTCra8Jg^ 
Cuir. Opto. CeS BIoL. 9t 782-787, 1997. 

25. Ghgras, A C M Raught, B„ and S<xienberg, N. Regulation of transla- 
tion initiation by FRAP/mTOR. Genes Dev, 16V 807-826, 2001. 

26. Glngras, A C, Gygi, S. P„ Raught, B., PoJaWewicz. a Abraham 
a T^ Hoekstra, M. F., AebersoW, R, and Sonenberg, N. Regulation of 
4E-BP1 phosphorylation: a novel two-step mechanism. Genes Dev.. 73/ 
1422-1437,1999. 

27. Kumar, v., Pandey. P^ SabafinL D n Kumar, Majumdar, P. 
Bharti, A, Camjfehael, a, Kufe, and Keibteno^ a Fur«^orwl hterax^^ 
between RAFTi/FRAP/mTDR and protein kinase C8 fin the regulation of 
cap-dependent initiation of translation. EMBO J., 19: 1 087-1097, 2000. 

28. Yang, D. a , and Kastan, M . a Participation of ATM in insulin sign eu>- 
flng through phosphorylation of e1F-4E-blnding protein 1. Nat Cell Btol. 
2: 893-898, 2000. 

29. Uu, O, Zhang, Y n Bode, A M., Ma, W. Y„ and Dong, Z. Phospho- 
rylation of 4E-BP1 Is mediated by the p38/MSK1 pathway in response to 
UVB irradiation. J. BfoL Chem., 277: 8810-8816, 2002. 

30. Neshat. M. S., Memnghoff, I. K, Tran, C, StSes, B., Thomas, Q_ 
Petersen, a, Frost, P., Gfebons, J. J, Wu. H.. and Sawyera, a U En- 
hanced sensitivity of PTEN-deiteterrt tumors to Inhibition of FRAP/mTOF*. 
Proc Nati. Acad ScL USA, 9& 10314-10319; 2001. 

31. Sekuflc, A, Hudson, C. Homme, J. U Yin, P, Ottemesa, D. 
KamHz, L M^ and Abraham, a T. A direct Gnkage between the phosphor 
nosftjde3-Wnaso-AKr sigiisJ^g pathway and the mammato target of mpa- 
mydn fcn mtogen-stimuiated and transforrned coli3. Cancer Res- 60r 3504— 
3513,2000. 

32. Scott, P. H., and Lawrence, J. C, Jr. Attenuation of mammalian tar9et 
of rapamycm activity by Increased cAMP In 3T3-^1 adipocytes, J. BtoL 
Chem., 273? 34498^4501, 1998. 

33. Reynolds. L T., Bodine, a C^ and Lawrence, J. C. f Jr. Control of 
Ser2448 phosphorylation En the mammalian target of rapamycm by Insulin 
and skeletal muscle load. J. BfoL Chem., 277; 17657-17682, 2002. 

34. Peterson, a t, BeaJ, P. A. Comb, M. J., and Schreiber, S. L. 
FKBP12^apamycin-assocIated protein (FRAP) autophosphoryiates at 
serine 2481 under translationafly repressive conditions. J. Bioi. Chem.. 
275; 7416-7423, 2000. 

35. Peterson, a T„ Desal, a N., Hardwlck, J. S.. and Schreiber, S. L. 
Protem phosphatase 2A Interacts wito the 7(WcDaS81onase and la acti- 
vated by Inhibition of FKBP12-rapaniycm-a8socteted protein. Proc Natf. 
Acad. Scl. USA, 96: 4438-4442, 1999. 

36\ McGrew, L. U. DwWn-Rastl, E, Dworkm, M. B.. and Rtehter, J. D. 
Poly(A) elongation during Xenopus oocyte maturation is leo^lred for trans^ 
lational recruitment and Is meolated by a short sequence element Genes 
Oev^ 3L' 803-81 5, 1989. 

37. Sheets, M. D., Wu, M., and Wlokens, M. Poryadenyiatlon of c-mos 
mRNA as a control point In Xenopus meiotic maturation. Nature (bond.) 
374: 511-616, 1995. 

38. Vamum. S. M., and Wormlngton, W. M. Deadenylation of maternal 
mRNAs during Xenopus oocyte maturation does not require specific 
cfc-sequences: a default mechanism for translational control. Genes Dev. 
4:2278-2286,1990. 

39. Gaille, D. R. The cap and poly(A) tail function aynerglsticalty to regu- 
late mRNA translational efficiency. Genes Dev., ft 2108-2116, 1991. 

40. Sachs, A B., and Varani, G. Eukaryotic translation Initiation: there are 
(at least) two sides to every story. Nat Struct. BIoL, 7: 358-361, 2000. 

41. WqJfte, A. P., and Merle, F. Coupling transcription to translation: a 
novel site for the regulation of eukaryotic gene expression. Int J. Blo- 
chem. Cell Biol., 28: 247-257, 1996. 

42. BrtoWmova, V. M., WeL C. L, SHJkov. A Simortenko, P. 
Uzarev, O. A, Vasltenko, K. S. Ustinov, V. A, Hershey, J. W., and 
Ovchlnntkov, L P. The major protein of messenger ribonucteoproteh 
particles In somatic cells Is a member of the Y-box binding transcription 
factor family. J. BioL Chem^ 27ft 3186-3192. 1995. 

43. Matsumoto. K., Merto, F., and Woiffe, A P. Translational repression 
©^pendent on the IntenK^ of the Xenopus Y-box protein FRGY2 whn 
mRNA. Role of the cold shock domain, tail domain, and selective RNA 
sequence recognition. J. Blot. Chem,, 271: 22706-22712, 1996. 



979 Transition Ntiafoo h Cancer 



44. EwtoWmova Ruzanov, P., fmataka, R, Raught, B., Svitkbi, Y., 
OvcWnntoov. L P., and Sonenberg, N. the major rnRNA-associated pro- 
tein YB-t fs a potent 6 1 cap-dependent mRNA stabifeer. EMBO J. t 20: 
6491-5502,2001. 

4$. Saito. ft. Heyday ► A. C, Wlman, K., Hayward, W.S^ aridTonegawa, 

&Activetion<rfthecHrrycgen^ 

control Proa Natl. Acad 3d USA, SO: 7476-7480. 1 883. 

48. NardwC^Wbi^AuoW^ 

and Prate, K C. Afternatfve transiation of the proto-oncogene c-myc by an 
internal rtoosorne entry stta J. Biol Chem^ 272: 33)61-32066, 1997, 

47. Stonefey, M„ Paulln, F. E, le Quesne, J. P„ CnappeB, a A. and 
WOIIs, A E c-Myc 5' untranslated region contains an fntefnal rfbosome 
entry segment Oncogene. 16: 423-428, 1998. 

48. I^HF.E l West.M.^SuKwan I M.F, Wrmey. R L, Ijne, U and 
W3Bs, A E Aberrant transnational control of the ornyc gene in multiple 
myeloma. Oncogene, 13: 505-613, 1996. 

49. CtappeO, S. A^ UQuesne, J. P., PauQn, F. E, de Schoolmeester, 
M. L, Stoneley, Kt, Soutar, R L, Ralston, & HeBrtch, M. R, and Willis, 
A E A mutation In the o-rnyo-f RES leads to enhanced fntemal rfbosomo 
entry hmuftipta rnyetoma: a novel mechanism of oncogene de-regutation. 
Oncogene, 19: 4437-4440, 2000. 

50. Signed. E, BagnV C, Papa, S., Prlmerano, a. Rmakfl, M., Amakfl, F., 
and Fazio, V. M. A. A somatic mutation In the 5*UTR of BRCA1 gene In 
sporadic breast cancer causes down-modulation of translation efficiency. 
Oncogene, 20: 4596-4600, 2001. 

51. Uu. L, D&vorth, D., Gao, L, Monzon, J., Summers, A, Lassam, ft, and 
Hogg, a Mutation of the COKN2A 5' UTR creates an aberrant taxation 
codon and precfisposes to melanoma. Nat Genet, 21: 128-132, 199a 

62. SavSsky.lC, Platzer, M..Uztel.T., Gilad, S.. Sarttet, A^ Rosenthal, A-. 
Boy-Stem, O., SHIoh, Y„ and Rotman/a Ataxfa-tetangleotada: struc- 
tural oTversfty of untranslated sequences suggests complex post-tran- 
scriptiona! regulation of ATM gene expression. Nudete Acids Res., 25: 
1678-1684, 1997. 

63. Brown, C. Y„ Mtze, G. J., Pineda, M„ George, D. L, and Morris, D: R 
Role of two upstream open reading frames In the translation^ control of 
oncogene mdm2. Oncogene, 16: 5631-6637, 1999. 

54. Sobczak, iC, and Krzyzostek, W. J. Structural determinants of BRCA1 
transJatfonal regulation. J. Bfol. Chem.. 277: 17349-17358, 2002. 
65. Amok, B. A* Grendell, R L, and Griffin, L A. Enhanced translation^ 
efficiency of a rtove! transforming growth factor ^3 mRNA In human breast 
cancer cetb. Mol. CeB. Blot, 14: 619-628, 1994. 
58. Myers, S. M., Eng. C.. Ponder, B.A., and Mulligan, L M. Character- 
ization of RET proto-oncogene 3' splicing variants and poryadenytatlon 
sites: a novel C-termmus tor RET. Oncogene. 11: 2039-2045, 1995. 

57. Byrne, J. A., Tomasetto, C, Rouyer, N.. BeOocq, J. P., Rio. M. C.. and 
Basset, P. The tissue Inhibitor of metaUopmtefnases-3 gene in breast 
carcinoma: Identification of multiple polyadenylation sites and a stromal 
pattern of expression. Mol. Med., 1: 418-427. 1995. 

58. Moscow, J. A., He, R. Gudas, J. M., and Cowan, K. H. Utilization of 
multiple polyadenylatlon signals in the human RHQA protooncogene. 
Gene (Amst.), 144: 229-236, 1994. 

59. Senterre-Lesenfants, S., Alag, A. S.. and Sobel, M. E Multiple mRNA 
species are generated by alternate potyadenylation from the human ca/m- 
ocft/JM gene. J. CeiL Blochera, 68: 445-454, 1995. 

60. Lazarte-Karatzas, A., Montine, K. S-, and Sonenberg, N. Malignant 
transformation by a eukaryotic Initiation factor subunrt that binds to mRNA 
5' cap. Nature (Lend.), 345: 644-647, 1990. 

61. De Benedettl, A., and Rhoads, R E Overexpresslon of eukaryotic pro- 
te&i syrithesb Inflation factor 4E in Heiaceteresuttslnabenartffowmend 
morphology. Proa Natl. Acad Sd USA, 87; 8212-8216, 1990. 

62. Pohmovaky, V. A.. Rosenwald. I B„ Tan, A. T H White, J n Chiang, L, 
Sonenberg, N., and Bftterman, P. B. Translatlonal control of programmed 
cell death: eukaryotic translation initiation factor 4E blocks apoptosis In 
gftwtrMacter-restf cted fibroblasts with physiologically expressed or de- 
regulated Myo. Mol. CeB. Bkrf., 16: 6573-6581, 1998. 

63. Mlyagfc Y„ Sugryama, A., Asai, OkazaH, T„ KucWno, Y„ and Kerr, 
S.. J. Bevated levels of eukaryotte translation Initiation factor elF4E mRNA in 
a broad spectrum of transfdnr»edc*OI^C 



64. Kerekatte, V. SmDey. Hu, B.. Smith, A, GeWer, and Do 
Benedetti, A.TT»proto-oncogene/trar^ 

expression In breast cardnomaa tnt J. Cancer, 64: 27-31 ,1995. 

65. UB.D.,Uu,U Dawson, M., and De Benedettl, A. Overexpresslon c»f 
eukaryotic InWatkmfactw 

79:2385-2390. 1897. 

W. SorreBa,aUBJacKD.R,Meschonat,C^Rf^ 

Gao, M., WiBtems, R J, and U. R a Detection of eF4E gene anipfficatton 

m breast cancer by competitive PCR Ana Surg. OncoL, 5c 232^37, 199B. 

67. U, B. D., McOonaW, J. C, rtessar, R^ and De Benedettl, A. OlnicaJ 
outcome In stage t to HI breast caroaioma and eiF4H overexpresslon. Ann. 
Surg., 227: 756-781; discussion, 761-763, 1998. 

68. Wang, R, RosonwaW, I. R, Hutzler, M. J., PSian, a A, Savas, U. 
Chen, J. and Woda, R A. Expression of the eukaryotic transiation 
Inmation factors 4E and 2a In non-Hodgkln's lymphcirnas. Am. J. PamoL # 
T55r 247-^55, 1999. 

69. Fulojchl^tmogorf, T., tehfl, L. KasWwagJ, K., Mashlba, R, EWrnoto, 
H., and Igarashl, K. MaJlgnant transformation by overproduction of trans- 
lation initiation factor efF4R Cancer Res,, 67; 5041-5044, 1997. 

70. eberie, J., KrasagaWs, rC and Orfanos, C. E Transiation initiation 
factor elF-4A1 mRNA b constetentiy overexpressed In human melanoma 
cells in vitro. Int J . Cancer, 71: 396-401 , 1 997. 

71. Shuda, M^ Kondoh, H, Tanaka, K., Ryo, A* Waxatsuki, T^ Kada, A.. 
GoseH K* (gari T. t Hatsuse, IC, Alhara, T n Hortuchi, R, Shtehfta, M., 
Yemamoto, N., and Yamamoto, M. Enhanced express ion of translation factor 
mRNA3lnhepatoce0u4arcaicin^ 200O. 

72. Nupponen, M NL, Porkka, K., KaWcoIa, L, Tanner, M„ Persson, K., 
Borg. A., tsola, and Vlsakorpi, T. Amplication and overexpresslon of 
p40 subunit of eukaryotic translation Initiation factor 3 in breast and 
prostate cancer. Am. J. Pamot, 164: 1777-1783, 1999. 

73. Rome, M„ Kb, Albers, P H and Wemert, N. Eukaryotic inttiation 
factor 3 p110 mRNA Is overexpressed In testicular seminomas. Am. O. 
PaihoLp 1S7: 1597-1604, 2000. . 

74. Bartund, M., Fcrozan. F. ( Kononen, J., Bubendorf, U Chen, Y., Brtt- 
ner, M. I_ Torhorst, J-, Haas, P., Bucher, a, Sauter, G. r KalQonlamS, a P., 
and Kaluontaml. A. Detecting activation of rtbosomaJ protein S6 kinase by 
complementary DNA and tissue mlcroarray analysis. Jl Natl Cancer Inst 
(Bethesda), 92: 1252-1259, 2000. 

75. Topafian, S. L, Keneko, S., Gonzales, M. I, Bond, G. L, Ward, Y. v and 
Manley, X L. ktentrftcation and functional characterization of neo-ooty(A) 
polymerase, an RNA processing enzyme overexpressed In human tumors. 
MoL CeH BtoL, 21; 5614-6623. 2001. 

76. Sccn1as,A^TalIeri,M. J Aro^van^A.,Cou^ 

TsJapaHs, C M^ and Trangas, T. Pdyadenylate rx>lyrnerase enzymatic ac- 
tivity In mammary tumor cytosote: a new independent prognostic marker m 
primary breast cancer. Cancer Res., 60: 5427-5433, 2000. 

77. Janz,M.,HaAeck,N.,Dettm^ 

Schmitt,M., and Royer,RD.Y-box factor YB-1 predicts drug resistance and 
patient outcome in breast cancer independent of dlnlcaUy relevant tumor 
biologic factors HER2, uPA and PAH. Int J. Cancer. 97: 278-282, 2002. 

78. Shlbahara, Suglo. IC, OsaW. T., UcWum!, T., Maehara, Y., Kohno. 
tC, Yasumoto, IC Suglmacrd, K., and Kuwano, M. Nuclear expression of 
the Y-box brndeig protein, YB-1 , as a novel marker of disease progression 
In non-small ced lung cancer. CQn. Cancer Res.. 7: 3151-3155, 2001. 

79. Kamura, T., Yahata, R, Amada, S., Ogawa, R, Sonoda, T., Koba- 
yashl, H.. Mrtsumoto. M., Kohno, K.. Kuwano, M., and Nakano, R rs 
nuclear expression of Y box-blncfing protem-1 a new prognostic factor In 
ovarian serous adenocarcinoma? Cancer (Phi la.), 85; 2450-2454, 1999. 

80. Bargou. R. C., Jurchott, IC, Wagoner, C M Bergmann, S.. Metzner, S„ 
Bommert, K., Mapara, M. Y., Wlnzer, K. J.. Dtetel, M., Dorken, B. t and 
Roy or, H. D. Nuclear localization and hcreased tevete of transcription 
factor YB-1 In primary human breast cancers are associated with Intrinsic 
MDR1 gene expression. Nat Med., 3: 447-450, 1997. 

81. AoW. M.. Btezek, E. and Vogt, P. K. A role of the kinase mTOR In 
cellular transformation Induced by the oncoproteins P3kandAkt Proa 
Natl. Acad. Scl. USA, 93: 136-141, 2001. 

82. ChBd,S.J^ MIDer. M. K., and GebaJla, A. P. CeB type^iependem and 
independent control of HER-2/neu translation. Int J. Blochem. Cell Biol, 
31: 201-213, 1899. 



Molecular Cancer Therapeutics 



83. Jofferics. H.R, Relnjiard, C, Kozma, 8. C and Thomas, Q. Rapa- 
raycw selectively represses translation of the "poJypyrbnkfina tract" 
rtflNAfamOy. Proa NatL Acad. ScL USA, ST; 4441-4445, 1994, 

84. Tsrada, K, Patel, k R, Takase, It, Kohno, Jt, Malm, A. C, and 
Getfand, E W. Rapamycln selectively Inhibits translation of mRNAs en- 
coding elongation factors and ribosomal proteins. Proc. NatL Acad. ScL 
USA, 91: t1477-1l461. 1994. 

85. Jefferies, H. B., FumagalH, 3., Dennis, P. B n Relnhard, C, Pearson, 
R B. f and Thomas, a Rapamycln suppresses 5TOP mRNA transfatJon 
through InhMton of p70s6k. BABO J., 16: 3693-3704. 1997. 

88. Beretta, L, Glngras, A. C, SvftWn, Y. V. t HaB, M. N., and Sonenberg, 
N. Rapamycfn Nocks the phosphorylation of 4E-BP1 and Inhibits cap- 
dependent Inflation of translation. EMBO J., 15: 658-664, 1996. 
67. Hidalgo, M n and Rowfnsty, E It The rapamydn-seraitive signal 
transduction pathway as a target for cancer therapy- Oncogene, 19: 
6680-6686, 2000. 

88. Hosc*H,dfiing,M.B,Sha^ 

Germain, a &, Abraham, a T., and Houghton, P. J. Rapamycln causes 
poc^reveialbblnhlbflfan of roTDR and induces p53-independentapoptoslS' 
b human rhabciornycsaaxoma ceDs. Cancer Res., 69: 888-894, 1999. 

89. Huang, a, and Houghton, P. J. Resistance to rapamycln: a novel 
anticancer drug. Cancer Metastasis Rev., 20: 89-78, 2001. 

90* Gdoerger, R. Kerr, it, Tang, C- B., Fung, It MU PowaB, R, Sutton, 
L M, PhBHps, P. C, and Janss, A. J. Antitumor activity of the rapamycln 
analog OCt-779 In human primitive neuroectodermal tumor/medufioblas- 
toma models as single agent and In combination chemotherapy. Cancer 
Res,, 61; 1&7-t532. 2001. 

91. Gibbons, J. J., Diseafard, a, Peterson, R, Hernandez, R, SkotnJctd, 
J„ and Frost, P. The effect of CCl-779, a novel macrofide anti-tumor 
agent, on the growth of human tumor cells in vitro and in nude mouse 
xenografts hi vhx>. Proc. Am. Assoc Cancer Res., 40: 301, 1999. , 

92. Hidalgo, M„ Rowlnsky, E, Erilchman. C, Marshall, B., Marks, R, 
Eo\ranJs, T., and Buckner, J. J. A Phase I and pharmacological study of 
CCl-779 cycle Inhibitor. Ana Oncol, 11 (SuppL 4): 133, 2001. 

93. Alexandre, J., Raymond, E, Depenbrock, H,, Mekhakfi, S., Angevin, 
E, PaSet, C, Hanauske, A., Frisch, J. r Feussnar, A, and Amtand, J. P. 
CCl-779, a new rapamycln analog, has antitumor activity at doses Induc- 
ing only mad cutaneous effects and mucositis: earfy results of an ongoing 
Phase I study. Proceedings of the 1999 AACR-NCf-EOHTC international 
Conference, CHo. Cancer Reo, 5 (Suppl): 3730a, 1999. 

94. Chan, S., Johnston, Scheuten, M. &, Mrosa, It, Mcrant, A, Lahr, 
A, Feussnar, A, Berger, M„ and Kbsch, T. First report a Phase 2 study 
of the safety and activity of CCl-779 for patients with locally advanced or 
metastatio breast cancer falling prior chemotherapy. Proa Am. Soc. Clin. 
Oncol., 21: 44a 2002. 

95. AUdne, M. a, Hidalgo, M., Stadler, V7\, Logan, T., Dutcher, J. P., Hudes, 
GL, Park, Y„ Marshafl, B„ Bonf, J., and Dukart, G. A randomized double-blind 
Phase 2 study of Intravenous CCl-779 administered weekly to patients with 
advanced renal ceO carcinoma Proc. Am. Soc. Clin. Oncol, 21: 10a, 2002. 

96. Smith, S. G., Trinh, C. M., Inge, L J., Thomas, Q., Ctoughsey, T. F., 
Sawyers, C. U and Mlschel, P. S. PTEto expression status predicts 
glioblastoma cell sensttivtty to CCl-779. Proa Am. Assoc. Cancer Res., 
43: 335, 2002. 

97. Yu, ft, ToraKBarza, U Discafani, C„ Zhang, W. O. SkotnJcW, J., 
Frost, P., and Gibbons, J. J. mTOR, a novel target In breast cancer the 
effect of CCl-779, an mTOR inhibitor, in preclinical models of breast 
cancer. En doer. Ralat Cancer, & 249-288, 2001 . . 

98. Dating, M R, Germain, G. S., Dudkln, L, Jayaraman, A. L, Zhang, X, 
Harwood,F. C, and Houghton, P. J. 4E-bfnding proteins, the suppressors of 
eukaryotlo hftatkxi factor 4E, are do wnregulated in cetts wflh acquired or 
frrtnnsio resistance to rapamych. X Bid Chero, 277: 13907-13917, 2002. 

99. Guba, M., von Broitenbuch, P., Stelnbauer, M., Koehl, O, Beget, S., 
Homung, M„ Bruns, C. J.. Zuelke, C, Farkas, $., Anthuber, M., Jauch, 
K. W., and Getssler. E It Rapamycln Inhibits primary and metastatio 
tumor growth by antlangiogenesls: Involvement of vascular endothelial 
growth factor. Nat Med., &• 128-135, 2002. 



100. Una, H. A., Schafi, Theuer. A., O'ReHry, and Wood, J. Ardi- 
angtogenJc activity of RAD001, an orally active anticancer agent Proc, 
Am. Assoc Cancer Res., 43: 184, 2002. 

101. Maeshbna, Y., Sudhakar^ A* Uvely, J. C, UeW, It, Kharbanda, 
Kahn, a R, Sonenberg, Hynes, R and Kalhtft R "Iumstatin,'ajn 
endothelial cefl-speeffle Inhibitor of protein synthesis. Science (Wash. DC), 
285:140-143,2002. 

102. CaygDl, C. P., Chartett, A^ and Hill, M. J. Fat, fish, fish oil and cancer. 
Br. J. Cancer, 74: 159-164, 1996. 

103. Falconer, J. S* Ross, J. A., Fearon, K. C, Hawkins, RA^O'RIordatri, 
M. G ., and Carter, D. C. Effect of elcosaporrtaenolo acid and other fatty 
acids on the growth In vitro of human pancreatic cancer ceB fines. Br. J* 
Cancer, 69; 826-832, 1994. 

104. Noguchi, M., Mmami, M, YagasaW, a, Ksioshita, It. Earasht, M., 
KUagawa, H. f Tanfya, T, and MiyazaJd, I. Chemoprevention of D MBA- 
Induced mammary carefnogenesb In rata by low-dose EPA and DHA. 
Br. J. Cancer, 75: 348-353, 1997. 

105. PaJakurthi, S. S., RucWger, R, Aktaa, H., Changolkar, A. It, Shah- 
safael, A., Hameft; $., Kffic, E, and Halperin, «l. A. inhibition of translation 
initiation mediates the anticancer effect of the n-3 polyunsaturated nitty 
add eipc^entaenolc add. Cancer Res^ 60: 2919-^925, 2000. 

106. Aktas,H^Fludo^,R^Acos^J.A, 

and Halperin, J. A. Depletion of WracefeiIarCa^ston39,pho^ 
eiF2a,andsusfatoedlrihBi^ 

effects of dotrtnazole. Proa Nati AcaaScl USA, 95c B280^8285t 1998. 

107. MhasWlkar, A. M. t Schrock, R D., Hind), M.. Uao, J., Sieger, K., 
Kourouma, F„ Zoo-Yang. X H^ Ontehl, E, Takh, O., Vedvtek, T. S., 
Fanger, G., Stewart, L, Watson, G. J., Snary, D, Fisher, P. R, Saeki, T., 
Roth, X A., Ramesh, R, and Chada, S. Melanoma diflerentiation associ- 
ated gene-7 (rnda-7): a novel antHumor gene for cancer gene therapy. 
Mot Med., 7: 271-282. 2001 . 

108. Su.Z.Z..Madfreddi,M.T^Un,J. J., Young, C. S^ Kitada, a, Reed, 
J. C, Goldstein, N. U and Fisher, P. B. The cancer growth suppressor 
gene mda-7 selectively induces epoptosls in human breast cancer cells . 
and Inhibits tumor growth In nude mice. Proc. Natl Acad. ScL USA, 95: 
1440QM4405, 1898. 

109. Saeki, T., Mhashtlkar; A., Chada, S., Branch, C, Roth, J. A, and 
Ramesh, R. Tumor-suppesslve effects by adenovfrus-medtated mda-7 
gene transfer In non-smaD ceil lung cancer ceU ft? vflro. Gene Ther,, 7: 
2051^057,2000. ' 

110. Pataer, A, Vorburger, a A, Berber, a M, Chada, S.. Mhashilkar, 
A M4 Zou-Yang, H^ Stewart A L, Balachandran^ S^ Rom, J. A, Hunt 
K. It, and Swisher, & a Adenoviral transfer of the melanoma different* 
atkm-assocfated gene 7 (pidaT) Induces apoptosls of lung cancer cells via 
up-fegulation of the double-stranded RNA-dependent protein kinase 
(PKR). Cancer Res., 62: 2239-2243, 2002. 

111. ito, T., Wamkea S. P.. and May, W. S. Protein synthesis Inhibition by 
flavonolds: roles of eukaryotic Initiation factor 2a kinases. Blochem. Blo- 
phys. Res. Commun^ 255: 589-694, 1999. 

1 1 2. Eberte, Fecker, L F., Bittner, J. U.. Orfanos, C. E. and Geiten, C C. 
Decreased rjroliferation of human melanoma cell lines caused by antJsense 
RNA against translation factor elF^AI . Br. J. Cancer, 88: 1957-1962, 2002. 

1 1 3. Poiunovsky, V. A, Glngras, A C Sonenberg, N ., Peterson, M., Tan, A, 
Rubins, J. R, Manhrai, J. C, and Bitterman, P. RTransIational control of the 
antiapoptotic function of Ras. J. Biol Cham., 275: 24776-24780, 2000. 

114. 0'Ojr^J.iKraizxetM.CU 

P. B., and Kratzke, R A Over-expression of the translations! repressor 
4E-BP1 inhibits NSCLC tumongenlcrty In vivo. Proa Am. Assoc Cancer Res.. 
43: 818-817, 2002. 

1 1 5. DeFatta, R J., Nathan, C. A, and Oe Benedettl, A. Antisense RNA to 
elF4E suppresses oncogenic rjroperttes of a head and neck squamous 
ceil carcinoma cell Una Laryngoscope, 110: 928-933. 2000. 

116. Herbert, T. P^ Fahraeus, R, Prescott, A, Lane, D. P., and Proud, 
C. a Rapid Induction of apoptosls mediated by peptides mat bind Initi- 
ation factor elF4E Cum Biol., 10: 793-798, 2000. 

117. DeFatta, R. J., U, Y^ and De Benedetti, A Selective klffing of cancer 
cells based on transitional control of a suicide gene. Cancer Gene Ther., 
9:673-578,2002. 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

13 IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
13 FADED TEXT OR DRAWING 
13 BLURRED OR ILLEGIBLE TEXT OR DRAWING 
13 SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

13 LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



